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Description 

COMPOSITIONS AND METHODS FOR THE TREATMENT 
AND DIAGNOSIS OF BREAST CANCER 

5 

Technical Field 

The present invention relates generally to the detection and therapy of 
breast cancer. The invention is more specifically related to nucleotide sequences that 
are preferentially expressed in breast tumor tissue and to polypeptides encoded by such 
10 nucleotide sequences. The nucleotide sequences and polypeptides may be used in 
vaccines and pharmaceutical compositions for the prevention and treatment of breast 
cancer. The polypeptides may also be used for the production of compounds, such as 
antibodies, useful for diagnosing and monitoring the progression of breast cancer in a 
patient. 

15 

Background of the Invention 

Breast cancer is a significant health problem for women in the United 
States and throughout the world. Although advances have been made in detection and 
treatment of the disease, breast cancer remains the second leading cause of cancer- 
20 related deaths in women, affecting more than 180,000 women in the United States each 
year. For women in North America, the life-time odds of getting breast cancer are now 
one in eight. 

No vaccine or other universally successful method for the prevention or 
treatment of breast cancer is currently available. Management of the disease currently 

25 relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment, which may include one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and hormone therapy. The course of 
treatment for a particular breast cancer is often selected based on a variety of prognostic 
parameters, including an analysis of specific tumor markers. See, e.g.. Porter- Jordan 

30 and Lippman, Breast Cancer 5:73-100 (1994). However, the use of established 
markers often leads to a result that is difficult to interpret, and the high mortality 
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observed in breast cancer patients indicates that improvements are needed in the 
treatment, diagnosis and prevention of the disease. 

Accordingly, there is a need in the art for improved methods for therapy 
and diagnosis of breast cancer. The present invention ftilfills these needs and fiirther 
5 provides other related advantages. 

Summarv of the Invention 

Briefly stated, the subject invention provides compositions and methods 
for the diagnosis and therapy of breast cancer. In one aspect, isolated DNA molecules 

10 are provided, comprising (a) a nucleotide sequence preferentially expressed in breast 
cancer tissue, relative to nonnal tissue; (b) a variant of such a sequence that contains 
one or more nucleotide substitutions, deletions, insertions and/or modifications at no 
more than 20% (preferably no more than 5%) of the nucleotide positions, such that the 
antigenic and/or immunogenic properties of the polypeptide encoded by the nucleotide 

15 sequence are retained; or (c) a nucleotide sequence encoding an epitope of a 
polypeptide encoded by at least one of the above sequences. In one embodiment, the 
isolated DNA molecule comprises a human endogenous retroviral sequence recited in 
SEQ ID N0:1. In other embodiments, the isolated DNA molecule comprises a 
nucleotide sequence recited in any one of SEQ ID NO:3 - SEQ ID NO:77 or SEQ ID 

20 NOS: 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 
209-214,216,218,219, 221-227. 

In related embodiments, the isolated DNA molecule encodes an epitope 
of a polypeptide, wherein the polypeptide is encoded by a nucleotide sequence that: (a) 
hybridizes to a sequence recited in any one of SEQ ID N0:1 or SEQ ID NO;3 - SEQ ID 

25 NO:77 or SEQ ID NOS: 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200- 
204, 206, 207, 209-214, 216, 218, 219, 221-227 under stringent conditions; and (b) is at 
least 80% identical to a sequence recited in any one of SEQ ID N0:1 or SEQ ID N0:3 - 
SEQIDNO:77 or SEQ ID NOS: 142, 143, 146-152, 154-166, 168-176, 178-192, 194- 
198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227; and wherein RNA 

30 corresponding to said nucleotide sequence is expressed at a greater level in human 
breast tumor tissue than in normal breast tissue. 
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In another embodiment, the present invention provides an isolated DNA 
molecule encoding an epitope of a polypeptide, the polypeptide being encoded by: (a) a 
nucleotide sequence transcribed from the sequence of SEQ ID NO: 1 4 1 ; or (b) a variant 
3 of said nucleotide sequence that contains one or more nucleotide substitutions, 

^ • ^ deletions, insertions and/or modifications at no more than 20% of the nucleotide 

I positions, such that the antigenic and/or immunogenic properties of the polypeptide 

encoded by the nucleotide sequence are retained. Isolated DNA and RNA molecules 
^ comprising a nucleotide sequence complementary to a DNA molecule as described 

above are also provided. 

' ^" f^^'ated aspects, the present invention provides recombinant 

expression vectors comprising a DNA molecule as described above and host cells 
transformed or transfected with such expression vectors. 
^ In further aspects, polypeptides, comprising an amino acid sequence 

n encoded by a DNA molecule as described above, and monoclonal antibodies that bind 

1 5 to such polypeptides are provided. 

In yet another aspect, methods are provided for determining the presence 
of breast cancer in a patient. In one embodiment, the method comprises detecting, 
within a biological sample, a polypeptide as described above. In another embodiment, 
the method comprises detecting, within a biological sample, an RNA molecule 
20 encoding a polypeptide as described above. In yet another embodiment, the method 
comprises (a) intradermally injecting a patient with a polypeptide as described above; 
and (b) detecting an immune response on the patient's skin and therefrom detecting the 
presence of breast cancer in the patient. In further embodiments, the present invention 
^ provides methods for determining the presence of breast cancer in a patient as described 

25 above wherein the polypeptide is encoded by a nucleotide sequence selected from the 
I group consisting of SEQ ID NO.: 78-86, SEQ ID NOS: 144, 145, 153, 167, 177, 193, 

199, 205, 208, 215, 217, 220 and sequences that hybridize thereto under stringent 
conditions. 

In a related aspect, diagnostic kits useful in the determination of breast 
30 cancer are provided. The diagnostic kits generally comprise either one or more 
antibodies as described above, or one or more monoclonal antibodies that 
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bind to a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of sequences provided in SEQ ID NOS: 78-86 and SEQ ID NOS: 144, 145, 
153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and a detection reagent. 

Within a related aspect, the diagnostic kit comprises a first polymerase 
5 chain reaction primer and a second polymerase chain reaction primer, the first and 
second primers each comprising at least about 10 contiguous nucleotides of an RNA 
molecule as described above, or an RNA molecule encoding a polypeptide encoded by 
a nucleotide sequence selected from the group consisting of SEQ ID NOS: 78-86 and 
SEQ ID NOS: 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220. 

1 0 Within another related aspect, the diagnostic kit comprises at least one 

oligonucleotide probe, the probe comprising at least about 1 5 contiguous nucleotides of 
a DNA molecule as described above, or a DNA molecule selected from the group 
consisting of SEQ ID NOS: 78-86 and SEQ ID NOS: 144, 145, 153, 167, 177, 193, 
199, 205, 208,215,217, 220. 

15 In another related aspect, the present invention provides methods for 

monitoring the progression of breast cancer in a patient. In one embodiment, the 
method comprises: (a) detecting an amount, in a biological sample, of a polypeptide as 
described above at a first point in time; (b) repeating step (a) at a subsequent point in 
time; and (c) comparing the amounts of polypeptide detected in steps (a) and (b), and 

20 therefrom monitoring the progression of breast cancer in the patient. In another 
embodiment, the method comprises (a) detecting an amount, within a biological 
sample, of an RNA molecule encoding a polypeptide as described above at a first point 
in time; (b) repeating step (a) at a subsequent point in time; and (c) comparing the 
amounts of RNA molecules detected in steps (a) and (b), and therefrom monitoring the 

25 progression of breast cancer in the patient. In yet other embodiments, the present 
invention provides methods for monitoring the progression of breast cancer in a patient 
as described above wherein the polypeptide is encoded by a nucleotide sequence 
selected form the group consisting of SEQ ID NO.: 78-86, SEQ ID NOS: 144, 145, 153, 
167, 177, 193, 199, 205, 208, 215, 217, 220 and sequences that hybridize thereto under 

30 stringent conditions. 
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In still other aspects, pharmaceutical compositions, which comprise a 
polypeptide as described above in combination with a physiologically acceptable 
carrier, and vaccines, which comprise a polypeptide as described above in combination 
with an immune response enhancer or adjuvant are provided. In yet other aspects, the 
5 present invention provides pharmaceutical compositions and vaccines comprising a 
polypeptide encoded by a nucleotide sequence selected form the group consisting of 
SEQ ID NO.: 78-86, SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217, 220 and sequences that hybridize thereto under stringent conditions. 

In related aspects, the present invention provides methods for inhibiting 
1 0 the development of breast cancer in a patient, comprising administering to a patient a 
pharmaceutical composition or vaccine as described above. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
15 references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

Brief Description of the Drawings 

Figure 1 shows the differential display PCR products, separated by gel 
20 electrophoresis, obtained from cDNA prepared from normal breast tissue (lanes 1 and 
2) and from cDNA prepared from breast tumor tissue from the same patient (lanes 3 
and 4). The arrow indicates the band corresponding to Bl 8Agl . 

Figure 2 is a northern blot comparing the level of BlSAgl mRNA in 
breast tumor tissue (lane 1) with the level in normal breast tissue. 
25 Figure 3 shows the level of B18Agl mRNA in breast tumor tissue 

compared to that in various normal and non-breast tumor tissues as determined by 
RNase protection assays. 

Figure 4 is a genomic clone map showing the location of additional 
retroviral sequences obtained from ends of Xbal restriction digests (provided in SEQ ID 
30 NO:3 - SEQ ID NO: 10) relative to B18Agl. 
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Figures 5A and 5B show the sequencing strategy, genomic organization 
and predicted open reading frame for the retroviral element containing BlSAgl . 

Figure 6 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B 1 8Agl . 

5 Figure 7 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B17Agl. 

Figure 8 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA Bl 7Ag2. 

Figure 9 shows the nucleotide sequence of the representative breast 
1 0 tumor-specific cDNA B 1 3 Ag2a. 

Figure 10 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B13Aglb. 

Figure 11 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B13Agla. 
•5 Figure 12 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA Bl lAgl. 

Figure 13 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B3CA3c. 

Figure 14 shows the nucleotide sequence of the representative breast 
20 tumor-specific cDNA B9CG 1 . 

Figure 15 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B9CG3. 

Figure 16 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B2CA2. 
25 Figure 17 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B3CA1 . 

Figure 18 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B3CA2. 

Figure 19 shows the nucleotide sequence of the representative breast 
30 tumor-specific cDNA B3CA3. 



JSDOCID: <WO 972S4g6A3 IB> 



wo 97/25426 



PCT/US97/00485 



Figure 20 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B4CA1. 

Figure 21 A depicts RT-PCR analysis of breast tumor genes in breast 
tumor tissues (lanes 1-8) and normal breast tissues {lanes 9-13) and H.O (lane 14). 
5 Figure 21 B depicts RT-PCR analysis of breast tumor genes in prostate 

tumors (lane 1,2), colon tumors (lane 3), lung tumor (lane 4), normal prostate (lane 5), 
normal colon (lane 6), normal kidney (lane 7), normal liver (lane 8), normal lung (lane 
9), normal ovary (lanes 10, 18), normal pancreases (lanes 11, 12), normal skeletal 
muscle (lane 13), normal skin (lane 14), normal stomach (lane 15), normal testes (lane 
10 16), normal small intestine (lane 17), HBL-100 (lane 19), MCF-12A (lane 20), breast 
tumors (lanes 21-23), Ufi (lane 24), and colon tumor (lane 25). 



Detailed Description of the Invention 

As noted above, the present invention is generally directed to 

15 compositions and methods for the diagnosis, monitoring and therapy of breast cancer. 
The compositions described herein include polypeptides, nucleic acid sequences and 
antibodies. Polypeptides of the present invention generally comprise at least a portion 
of a protein that is expressed at a greater level in human breast tumor tissue than in 
normal breast tissue (i.e., the level of RNA encoding the polypeptide is at least 2-fold 

20 higher in tumor tissue). Such polypeptides are referred to herein as breast tumor- 
specific polypeptides, and cDNA molecules encoding such polypeptides are referred to 
as breast tumor-specific cDNAs. Nucleic acid sequences of the subject invention 
generally comprise a DNA or RNA sequence that encodes all or a portion of a 
polypeptide as described above, or that is complementary to such a sequence. 

25 Antibodies are generally immune system proteins, or firagments thereof, that are 
capable of binding to a portion of a polypeptide as described above. Antibodies can be 
produced by cell culture techniques, including the generation of monoclonal antibodies 
as described herein, or via transfection of antibody genes into suitable bacterial or 
mammalian cell hosts, in order to allow for the production of recombinant antibodies. 

30 Polypeptides within the scope of this invention include, but are not 

limited to, polypeptides (and epitopes thereoO encoded by a human endogenous 
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retroviral sequence, such as the sequence designated BlSAgl (Figure 5 and SEQ ID 
N0:1). Also within the scope of the present invention are polypeptides encoded by 
other sequences within the retroviral genome containing BlSAgl (SEQ ID NO: 141). 
Such sequences include, but are not limited to, the sequences recited in SEQ ID NO:3 - 
5 SEQ ID NO: 10. BlSAgl has homology to the gag p30 gene of the endogenous human 
retroviral element S71, as described in Werner et ai.. Virology 77^:225-238 (1990) and 
also shows homology to about thirty other retroviral gag genes. As discussed in more 
detail below, the present invention also includes a number of additional breast tumor- 
specific polypeptides, such as those encoded by the nucleotide sequences recited in 

10 SEQ ID NO: 11 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166, 168- 
176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227. As used 
herein, the term "polypeptide" encompasses amino acid chains of any length, including 
full length proteins containing the sequences recited herein. A polypeptide comprising 
an epitope of a protein containing a sequence as described herein may consist entirely 

15 of the epitope, or may contain additional sequences. The additional sequences may be 
derived from the native protein or may be heterologous, and such sequences may (but 
need not) possess immunogenic or antigenic properties. 

An "epitope," as used herein is a portion of a polypeptide that is 
recognized (/.e., specifically bound) by a B-cell and/or T-cell surface antigen receptor. 

20 Epitopes may generally be identified using well known techniques, such as those 
summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) 
and references cited therein. Such techniques include screening polypeptides derived 
from the native polypeptide for the ability to react with antigen-specific antisera and/or 
T-cell lines or clones. An epitope of a polypeptide is a portion that reacts with such 

25 antisera and/or T-cells at a level that is similar to the reactivity of the full length 
polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Such screens may 
generally be performed using methods well known to those of ordinary skill in the art, 
such as those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold 
Spring Harbor Laboratory, 1988. B-cell and T-cell epitopes may also be predicted via 

30 computer analysis. Polypeptides comprising an epitope of a polypeptide that is 
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preferentially expressed in a tumor tissue (with or without additional amino acid 
sequence) are within the scope of the present invention. 

The compositions and methods of the present invention also encompass 
variants of the above polypeptides and nucleic acid sequences encoding such 
5 polypeptides. A polypeptide "variant," as used herein, is a polypeptide that differs from 
the native polypeptide in substitutions and/or modificaUons, such that the antigenic 
and/or immunogenic properties of the polypeptide are retained. Such variants may 
generally be identified by modifying one of the above polypeptide sequences and 
evaluating the reactivity of the modified polypeptide with antisera and/or T-cells as 

10 described above. Nucleic acid variants may contain one or more substitutions, 
deletions, insertions and/or modifications such that the antigenic and/or immunogenic 
properties of the encoded polypeptide are retained. One preferred variant of the 
polypeptides described herein is a variant that contains nucleotide substitutions, 
deletions, insertions and/or modifications at no more than 20% of the nucleotide 

15 positions. 

Preferably, a variant contains conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
20 polypeptide to be substantially unchanged. In general, the following groups of amino 
acids represem conservative changes: (I)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. 

Variants may also (or alternatively) be modified by, for example, the 
25 deletion or addition of amino acids that have minimal influence on the immunogenic or 
antigenic properties, secondary structure and hydropathic nature of the polypeptide. 
For example, a polypeptide may be conjugated to a signal (or leader) sequence at the N- 
terminai end of the protein which co-translationally or post-translationally directs 
transfer of the protein. The polypeptide may also be conjugated to a linker or other 
30 sequence for ease of synthesis, purification or identification of the polypeptide {e.g.. 
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poly-His), or to enhance binding of the polypeptide to a solid support. For example, a 
polypeptide may be conjugated to an immunoglobulin Fc region. 

In general, nucleotide sequences encoding all or a portion of the 
polypeptides described herein may be prepared using any of several techniques. For 
5 example, cDNA molecules encoding such polypeptides may be cloned on the basis of 
the breast tumor-specific expression of the corresponding mRNAs, using differential 
display PGR. This technique compares the amplified products firom RNA template 
prepared from normal and breast tumor tissue. cDNA may be prepared by reverse 
transcription of RNA using a (dT),2AG primer. Following amplification of the cDNA 

10 using a random primer, a band corresponding to an amplified product specific to the 
tumor RNA may be cut out from a silver stained gel and subcloned into a suitable 
vector (e.g., the T- vector, Novagen, Madison, WI). Nucleotide sequences encoding all 
or a portion of the breast tumor-specific polypeptides disclosed herein may be amplified 
from cDNA prepared as described above using the random primers shown in SEQ ID 

15 NO.:87-125. 

Alternatively, a gene encoding a polypeptide as described herein (or a 
portion thereof) may be amplified from human genomic DNA, or from breast tumor 
cDNA, via polymerase chain reaction. For this approach, BlSAgl sequence-specific 
primers may be designed based on the sequence provided in SEQ ID N0:1, and may be 

20 purchased or synthesized. One suitable primer pair for amplification from breast tumor 
cDNA is (5'ATG GCT ATT TTC GOG GGC TGA CA) (SEQ ID NO.: 126) and 
(5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID NO.:127). An amplified portion 
of BlSAgl may then be used to isolate the fiill length gene from a himian genomic 
DNA library or from a breast tumor cDNA library, using well known techniques, such 

25 as those described in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratories, Cold Spring Harbor, NY (1989). Other sequences within 
the retroviral genome of which BlSAgl is a part may be similarly prepared by 
screening human genomic libraries using BlSAgl -specific sequences as probes. 
Nucleotides translated into protein from the retroviral genome shown in SEQ ID NO: 

30 141 may then be determined by cloning the corresponding cDNAs, predicting the open 
reading frames and cloning the appropriate cDNAs into a vector containing a viral 



^4S0OCID: <WO_97254aeA3JB> 



wo 97/25426 



PCT/US97/00485 



]] 

promoter, such as T7. The resulting constructs can be employed in a translation 
reaction, using techniques known to those of skill in the art, to identify nucleotide 
sequences which result in expressed protein. Similarly, primers specific for the 
remaining breast tumor-specific polypeptides described herein may be designed based 
5 on the nucleotide sequences provided in SEQ ID NO: 1 1 - SEQ ID NO:86 and SEQ ID 
NO:142-SEQ ID NO:226. 

Recombinant polypeptides encoded by the DNA sequences described 
above may be readily prepared from the DNA sequences. For example, supematants 
from suitable host/vector systems which secrete recombinant protein or polypeptide 
10 into culture media may be first concentrated using a commercially available filter. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse 
phase HPLC steps can be employed to further purify a recombinant polypeptide. 

In general, any of a variety of expression vectors known to those of 
1 5 ordinary skill in the art may be employed to express recombinant polypeptides of this 
invention, Expression may be achieved in any appropriate host cell that has been 
transformed or transfected with an expression vector containing a DNA molecule that 
encodes a recombinant polypeptide. Suitable host cells include prokaryotes, yeast and 
higher eukaryotic cells. Preferably, the host cells employed are £ colL yeast or a 
20 mammalian cell line such as COS or CHO. 

Such techniques may also be used to prepare polypeptides comprising 
epitopes or variants of the native polypeptides. For example, variants of a native 
polypeptide may generally be prepared using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis, and sections of the DNA sequence 
25 may be removed to permit preparation of truncated polypeptides. Portions and other 
variants having fewer than about 100 amino acids, and generally fewer than about 50 
amino acids, may also be generated by synthetic means, using techniques well known 
to those of ordinary skill in the art. For example, such polypeptides may be symhesized 
using any of the commercially available solid-phase techniques, such as the Merrifield 
30 solid-phase synthesis method, where amino acids are sequentially added to a growing 
amino acid chain. See Merrifield, J. Am. Chem. Soc. 55:2149-2146 (1963). Equipment 
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for automated synthesis of polypeptides is commercially available from suppliers such 
as Applied BioSystems, Inc., Foster City, CA, and may be operated according to the 
manufacturer's instructions. 

In specific embodiments, polypeptides of the present invention 
5 encompass amino acid sequences encoded by a DNA molecule having a sequence 
recited in any one of SEQ ID N0:1 or SEQ ID N0:3 - SEQ ID NO:77 or SEQ ID 
NOS:142, 143, 146-152, 154-166, 168.176, 178-192, 194-198, 200-204, 206, 207, 209- 
214, 216, 218, 219, 221-227, variants of such polypeptides that are encoded by DNA 
molecules containing one or more nucleotide substitutions, deletions, insertions and/or 

10 modifications at no more than 20% of the nucleotide positions, and epitopes of the 
above polypeptides. Polypeptides within the scope of the present invention also include 
polypeptides (and epitopes thereof) encoded by DNA sequences that hybridize to a 
DNA molecule having a sequence recited in any one of SEQ ID NO: 1 or SEQ ID N0:3 
- SEQ IDNO:77 or SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194- 

15 198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227 under suingent conditions, 
wherein the DNA sequences are at least 80% identical in overall sequence to a recited 
sequence and wherein RNA corresponding to the nucleotide sequence is expressed at a 
greater level in human breast tumor tissue than in normal breast tissue. As used herein, 
"stringent conditions" refers to prewashing in a solution of 6X SSC, 0.2% SDS; 

20 hybridizing at 65^C, 6X SSC, 0.2% SDS ovemight; followed by two washes of 30 
minutes each in IX SSC, 0.1% SDS at 65°C and two washes of 30 minutes each in 1 X 
SSC, 0.1% SDS at 65**C. DNA molecules according to the present invention include 
molecules that encode any of the above polypeptides. 

In another aspect of the present invention, antibodies are provided. Such 

25 antibodies may be prepared by any of a variety of techniques known to those of 
ordinary skill in the art. See, e.g., Harlow and Lane, Antibodies: A Laboratory 
Manual J Cold Spring Harbor Laboratory, 1988. In one such technique, an immxmogen 
comprising the polypeptide is initially injected into any of a wide variety of mammals 
{e.g.y mice, rats, rabbits, sheep or goats). In this step, the polypeptides of this invention 

30 may serve as the immunogen without modification. Alternatively, particularly for 
relatively short polypeptides, a superior immune response may be elicited if the 
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polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole 
limpet hemocyanin. The immunogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster 
immunizations, and the animals are bled periodically. Polyclonal antibodies specific 
5 for the polypeptide may then be purified from such antisera by, for example, affinity 
chromatography using the polypeptide coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur 1 
Immunol 6:5 11 -5 19 (1976), and improvements thereto. Briefly, these methods involve 

10 the preparation of inunortal cell lines capable of producing antibodies having the 
desired specificity {/.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 

15 animal. A variety of fusion techniques may be employed. For example, the spleen 
cells and myeloma cells may be combined with a nonionic detergent for a few minutes 
and then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 

20 colonies of hybrids are observed. Single colonies are selected and their culture 
supematants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of 
growing hybridoma colonies. In addition, various techniques may be employed to 

25 enhance the yield, such as injection of the hybridoma cell line into the peritoneal cavity 
of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be 
harvested from the ascites fluid or the blood. Contaminants may be removed from the 
antibodies by conventional techniques, such as chromatography, gel filtration, 
precipitation, and extraction. The polypeptides of this invention may be used in the 

30 purification process in, for example, an affinity chromatography step. 
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Antibodies may be used, for example, in methods for detecting breast 
cancer in a patient. Such methods involve using an antibody to detect the presence or 
absence of a breast tumor-specific polypeptide as described herein in a suitable 
biological sample. As used herein, suitable biological samples include tumor or normal 
5 tissue biopsy, mastectomy, blood, lymph node, serum or urine samples, or other tissue, 
homogenate, or extract thereof obtained firom a patient. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using an antibody to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 

10 1988. For example, the assay may be performed in a Western blot format, wherein a 
protein preparation from the biological sample is submitted to gel electrophoresis, 
transferred to a suitable membrane and allowed to react with the antibody. The 
presence of the antibody on the membrane may then be detected using a suitable 
detection reagent, as described below. 

15 In another embodiment, the assay involves the use of antibody 

immobilized on a solid support to bind to the polypeptide and remove it from the 
remainder of the sample. The bound polypeptide may then be detected using a second 
antibody or reagent that contains a reporter group. Alternatively, a competitive assay 
may be utilized, in which a polypeptide is labeled with a reporter group and allowed to 

20 bind to the immobilized antibody after incubation of the antibody with the sample. The 
extent to which components of the sample inhibit the binding of the labeled polypeptide 
to the antibody is indicative of the reactivity of the sample with the immobilized 
antibody, and as a result, indicative of the concentration of polypeptide in the sample. 

The solid support may be any material known to those of ordinary skill 

25 in the art to which the antibody may be attached. For example, the solid support may 
be a test well in a microtiter plate or a nitrocellulose filter or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 

30 Patent No. 5,359,681. 
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The antibody may be immobilized on the solid support using a variety of 
techniques known to those in the art, which are amply described in the patent and 
scientific literature. In the context of the present invention, the term "immobilization" 
refers to both noncovalent association, such as adsorption, and covalent attachment 
5 (which may be a direct linkage between the antigen and ftmctional groups on the 
support or may be a linkage by way of a cross-linking agent). Immobilization by 
adsorption to a well in a microliter plate or to a membrane is preferred. In such cases, 
adsorption may be achieved by contacting the antibody, in a suitable buffer, with the 
solid support for a suitable amount of time. The contact time varies with temperature, 
10 but is typically between about 1 hour and I day. In general, contacting a well of a 
plastic microliter plate (such as polystyrene or polyvinylchloride) with an amount of 
antibody ranging from about lOng to about 1 ^g, and preferably about 100-200 ng, is 
sufficient to immobilize an adequate amount of polypeptide. 

Covalent attachment of antibody to a solid support may also generally be 
15 achieved by first reacting the support with a bifunclional reagent that will react with 
both the support and a functional group, such as a hydroxyl or amino group, on the 
antibody. For example, the antibody may be covalently attached to supports having an 
appropriate polymer coating using benzoquinone or by condensation of an aldehyde 
group on the support with an amine and an active hydrogen on the binding partner {see. 
20 e.^.. Pierce Immunotechnology Catalog and Handbook (1991) at A12-A13). 

In certain embodiments, the assay for detection of polypeptide in a 
sample is a two-antibody sandwich assay. This assay may be performed by first 
contacting an antibody that has been immobilized on a solid support, commonly the 
well of a microtiter plate, with the biological sample, such that the polypeptide within 
25 the sample are allowed to bind to the inunobilized antibody. Unbound sample is then 
removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains bound to the solid support is then 
determined using a method appropriate for the specific reporter group. 
30 More specifically, once the antibody is immobilized on the support as 

described above, the remaining protein binding sites on the support are typically 
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blocked. Any suitable blocking agent known to those of ordinary skill in the art, such 
as bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 
5 phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time {i.e., incubation time) is that period of time that is sufficient to detect the presence 
of polypeptide within a sample obtained from an individual with breast cancer. 
Preferably, the contact time is sufficient to achieve a level of binding that is at least 
95% of that achieved at equilibrium between bound and unbound polypeptide. Those 
1 0 of ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient. 

Unbound sample may then be removed by washing the solid support 
15 with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include enzymes (such as horseradish peroxidase), substrates, 
cofactors, inhibitors, dyes, radionuclides, luminescent groups, fluorescent groups and 
biotin. The conjugation of amibody to reporter group may be achieved using standard 

20 methods known to those of ordinary skill in the art. 

The second antibody is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound second antibody is then removed 

25 and bound second antibody is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin. coupled to a different 

30 reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
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specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of breast cancer, the signal 
detected from the reporter group that remains bound to the solid support is generally 
5 compared to a signal that corresponds to a predetermined cut-off value established from 
non-tumor tissue. In one preferred embodiment, the cut-off value is the average mean 
signal obtained when the immobilized antibody is incubated with samples from patients 
without breast cancer. In general, a sample generating a signal that is three standard 
deviations above the predetermined cut-off value may be considered positive for breast 
10 cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et al.. Clinical 
Epidemiology: A Basic Science for Clinical Medicine, p. 106-7 (Little Brown and Co., 
1985). Briefly, in this embodiment, the cut-off value may be determined from a plot of 
pairs of true positive rates {i.e., sensitivity) and false positive rates (100%-specificity) 
1 5 that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand comer {i.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be 
considered positive. Alternatively, the cut-off value may be shifted to the left along the 
20 plot, to minimize the false positive rate, or to the right, to minimize the faise negative 
rate. In general, a sample generating a signal that is higher than the cut-off value 
determined by this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the antibody is immobilized on a membrane, such as 
25 nitrocellulose. In the flow-through test, the polypeptide within the sample bind to the 
immobilized antibody as the sample passes through the membrane. A second, labeled 
antibody then binds to the antibody-polypeptide complex as a solution containing the 
second antibody flows through the membrane. The detection of bound second antibody 
may then be performed as described above. In the strip test format, one end of the 
30 membrane to which antibody is bound is immersed in a solution containing the sample. 
The sample migrates along the membrane through a region containing second antibody 
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and to the area of immobilized antibody. Concentration of second antibody at the area 
of immobilized antibody indicates the presence of breast cancer. Typically, the 
concentration of second antibody at that site generates a pattern, such as a line, that can 
be read visually. The absence of such a pattern indicates a negative result. In general, 
5 the amount of antibody immobilized on the membrane is selected to generate a visually 
discernible pattern when the biological sample contains a level of polypeptide that 
would be sufficient to generate a positive signal in the two-antibody sandwich assay, in 
the format discussed above. Preferably, the amount of antibody inunobilized on the 
membrane ranges from about 25 ng to about 1 ^g, and more preferably from about 50 
10 ng to about 1 ng. Such tests can typically be performed with a very small amount of 
biological sample. 

The presence or absence of breast cancer in a patient may also be 
determined by evaluating the level of mRNA encoding a breast tumor-specific 
polypeptide as described herein within the biological sample {e.g.. a biopsy, 

15 mastectomy and/or blood sample fi*om a patient) relative to a predetermined cut-off 
value. Such an evaluation may be achieved using any of a variety of methods known to 
those of ordinary skill in the art such as, for example, in situ hybridization and 
amplification by polymerase chain reaction. For example, polymerase chain reaction 
may be used to amplify sequences fi-om cDNA prepared from RNA that is isolated from 

20 one of the above biological samples. Sequence-specific primers for use in such 
amplification may be designed based on the sequences provided in any one of SEQ ID 
N0:1 or SEQ ID NO: 11 - SEQ ID NO:86 and SEQ ID NO:142-SEQ ID NO:226, and 
may be purchased or synthesized. In the case of BlSAgl, as noted herein, one suitable 
primer pair is B18Agl-2 (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID 

25 NO.:126) and B18Agl-3 (5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID 
NO.: 127). The PGR reaction products may then be separated by gel electrophoresis 
and visualized according to methods well known to those of ordinary skill in the art. 
Amplification is typically performed on samples obtained ft-om matched pairs of tissue 
(tumor and non-tumor tissue from the same individual) or from unmatched pairs of 

30 tissue (tumor and non-tumor tissue from different individuals). The amplification 
reaction is preferably performed on several dilutions of cDNA spanning two orders of 
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magnitude. A two-fold or greater increase in expression in several dilutions of the 
tumor sample as compared to the same dilution of the non-tumor sample is considered 
positive. 

Conventional RT-PCR protocols using agarose and ethidium bromide 
5 staining while important in defining gene specificity do not lend themselves to 
diagnostic kit development because of the time and effort required in making them 
quantitative (i.e., construction of saturation and/or titration curves), and their sample 
throughput. This problem is overcome by the development of procedures such as real 
time RT-PCR which allows for assays to be performed in single tubes, and in turn can 
10 be modified for use in 96 well plate formats. Instrumentation to perform such 
methodologies are available from ABI/Perkin Elmer. Alternatively, other high 
throughput assays using labelled probes (e.g., digoxygenin) in combination with 
labelled (e.g., enzyme fluorescent, radioactive) antibodies to such probes can also be 
used in the development of 96 well plate assays. 

^ 5 In yet another method for determining the presence or absence of breast 

cancer in a patient, one or more of the breast tumor-specific polypeptides described 
may be used in a skin test. As used herein, a "skin test" is any assay performed directly 
on a patient in which a delayed-type hypersensitivity (DTH) reaction (such as swelling, 
reddening or dermatitis) is measured following intradermal injection of one or more 

20 polypeptides as described above. Such injection may be achieved using any suitable 
device sufficient to contact the polypeptide or polypeptides with dermal cells of the 
patient, such as a tuberculin syringe or 1 mL syringe. Preferably, the reaction is 
measured at least 48 hours after injection, more preferably 48-72 hours. 

The DTH reaction is a cell-mediated immune response, which is greater 

25 in patients that have been exposed previously to a test antigen (i.e., an immunogenic 
portion of a polypeptide employed, or a variant thereof). The response may measured 
visually, using a ruler. In general, a response that is greater than about 0.5 cm in 
diameter, preferably greater than about 5.0 cm in diameter, is a positive response, 
indicative of breast cancer. 

^0 The breast tumor-specific polypeptides described herein are preferably 

formulated, for use in a skin test, as pharmaceutical compositions containing at least 
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one polypeptide and a physiologically acceptable carrier, such as water, saline, alcohol, 
or a buffer. Such compositions typically contain one or more of the above polypeptides 
in an amount ranging from about 1 fig to 100 ng, preferably from about 10 fig to 50 ^g 
in a volume of 0,1 mL. Preferably, the carrier employed in such pharmaceutical 
5 compositions is a saline solution with appropriate preservatives, such as phenol and/or 
Tween 80^". 

In other aspects of the present invention, the progression and/or response 
to treatment of a breast cancer may be monitored by performing any of the above assays 
over a period of time, and evaluating the change in the level of the response (i.e., the 

1 0 amount of polypeptide or mRN A detected or, in the case of a skin test, the extent of the 
immune response detected). For example, the assays may be performed every month to 
every other month for a period of 1 to 2 years. In general, breast cancer is progressing 
in those patients in whom the level of the response increases over time. In contrast, 
breast cancer is not progressing when the signal detected either remains constant or 

1 5 decreases with time. 

In further aspects of the present invention, the compounds described 
herein may be used for the immunotherapy of breast cancer. In these aspects, the 
compounds (which may be polypeptides, antibodies or nucleic acid molecules) are 
preferably incorporated into pharmaceutical compositions or vaccines. Pharmaceutical 

20 compositions comprise one or more such compounds and a physiologically acceptable 
carrier. Vaccines may comprise one or more polypeptides and an immune response 
enhancer, such as an adjuvant or a liposome (into which the compound is incorporated). 
Pharmaceutical compositions and vaccines may additionally contain a delivery system, 
such as biodegradable microspheres which are disclosed, for example, in U.S. Patent 

25 Nos. 4,897,268 and 5,075,109. Pharmaceutical compositions and vaccines within the 
scope of the present invention may also contain other compounds, including one or 
more separate polypeptides. 

Alternatively, a vaccine may contain DNA encoding one or more of the 
polypeptides as described above, such that the polypeptide is generated in situ. In such 

30 vaccines, the DNA may be present within any of a variety of delivery systems known to 
those of ordinary skill in the art, including nucleic acid expression systems, bacteria and 
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viral expression systems. Appropriate nucleic acid expression systems contain the 
necessary DNA sequences for expression in the patient (such as a suitable promoter and 
terminating signal). Bacterial delivery systems involve the administration of a 
bacterium (such as Bacillus-Calmette-Guerrm) that expresses an immunogenic ponion 

5 of the polypeptide on its cell surface. In a preferred embodiment, the DNA may be 
introduced using a viral expression system (e.^., vaccinia or other pox virus, retrovirus, 
or adenovirus), which may involve the use of a non-pathogenic (defective), replication 
competent virus. Techniques for incorporating DNA into such expression systems are 
well known to those of ordinary skill in the art. The DNA may also be "naked," as 

0 described, for example, in Ulmer et al., Science 259:1745-1749 (1993), and reviewed 
by Cohen, Science 259:1691-1692 (1993). The uptake of naked DNA may be increased 
by coating the DNA onto biodegradable beads, which are efficiently transported into 
the cells. 

While any suitable carrier known to those of ordinary skill in the art may 

5 be employed in the pharmaceutical compositions of this invention, the type of carrier 
will vary depending on the mode of administration. For parenteral administration, such 
as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, 
a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, 
such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, 

) cellulose, glucose, sucrose, and magnesium carbonate, may be employed. 
Biodegradable microspheres (e.g., polylactate polyglycolate) may also be employed as 
carriers for the pharmaceutical compositions of this invention. 

Any of a variety of adjuvants may be employed in the vaccines of this 
invention to nonspecifically enhance the immune response. Most adjuvants contain a 

5 substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, 
Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable 
adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, MI), Merck Adjuvant 65 (Merck 

) and Company, Inc., Rahway, NJ), alum, biodegradable microspheres, monophosphoryl 
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lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2, -7, or -12, may also 
be used as adjuvants. 

The above pharmaceutical compositions and vaccines may be used, for 
example, for the therapy of breast cancer in a patient. As used herein, a "patient" refers 

5 to any warm-blooded animal, preferably a human. A patient may or may not be 
afflicted with breast cancer. Accordingly, the above pharmaceutical compositions and 
vaccines may be used to prevent the development of breast cancer or to treat a patient 
afflicted with breast cancer. To prevent the development of breast cancer, a 
pharmaceutical composition or vaccine comprising one or more polypeptides as 

10 described herein may be administered to a patient. Alternatively, naked DNA or 
plasmid or viral vector encoding the polypeptide may be administered. For treating a 
patient with breast cancer, the pharmaceutical composition or vaccine may comprise 
one or more polypeptides, antibodies or nucleotide sequences complementary to DNA 
encoding a polypeptide as described herein (e.g., antisense RNA or antisense 

1 5 deoxyribonucleotide oligonucleotides). 

Routes and frequency of administration, as well as dosage, will vary 
from individual to individual. In general, the pharmaceutical compositions and 
vaccines may be administered by injection (e.g., intracutaneous, intramuscular, 
intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Between 1 

20 and 10 doses may be administered for a 52-week period. Preferably, 6 doses are 
administered, at intervals of 1 month, and booster vaccinations may be given 
periodically thereafter. Alternate protocols may be appropriate for individual patients. 
A suitable dose is an amount of a compound that, when administered as described 
above, is capable of promoting an anti-timior immune response. Such response can be 

25 monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent 
generation of cytolytic effector cells capable of killing the patient's tumor cells in vitro. 
Such vaccines should also be capable of causing an inruntme response that leads to an 
improved clinical outcome (e.g., more frequent remissions, complete or partial or 
longer disease-free survival) in vaccinated patients as compared to non-vaccinated 

30 patients. In general, for pharmaceutical compositions and vaccines comprising one or 
more polypeptides, the amount of each polypeptide present in a dose ranges from about 
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100 ^^g to 5 mg. Suitable dose sizes will vary with the size of the patient, but will 
typically range from about 0.1 mL to about 5 mL. 

The following Examples are offered by way of illustration and not by 
5 way of limitation. 
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EXAMPLES 
Example 1 

Preparation of Breast Tumor-Specific cDNAs Using Differential Display RT-PCR 

5 

This Example illustrates the preparation of cDNA molecules encoding 
breast tumor-specific polypeptides using a differential display screen. 

A. Preparation of Bl 8Agl cDNA and Characterization of mRNA Expression 

10 Tissue samples were prepared from breast tumor and normal tissue of a 

patient with breast cancer that was confirmed by pathology after removal from the 
patient. Normal RNA and tumor RNA was extracted from the samples and mRNA was 
isolated and converted into cDNA using a (dT)j2AG (SEQ ID NO.: 130) anchored 3' 
primer. Differential display PCR was then executed using a randomly chosen primer 

15 (CTTCAACCTC) (SEQ ID NO.:103). Amplification conditions were standard buffer 
containing 1.5 mM MgClj, 20 pmol of primer, 500 pmol dNTP, and 1 unit of Tag DNA 
polymerase (Perkin-Elmer, Branchburg, NJ). Forty cycles of amplification were 
performed using 94**C denaturation for 30 seconds, 42**C annealing for 1 minute, and 
ITC extension for 30 seconds. An RNA fingerprint containing 76 amplified products 

20 was obtained. Although the RNA fingerprint of breast tumor tissue was over 98% 
identical to that of the normal breast tissue, a band was repeatedly observed to be 
specific to the RNA fingerprint pattern of the tumor. This band was cut out of a silver 
stained gel, subcloned into the T-vector (Novagen, Madison, WI) and sequenced. 

The sequence of the cDNA, referred to as BlSAgl, is provided in SEQ 

25 ID N0:1. A database search of GENBANK and EMBL revealed that the BlSAgl 
fragment initially cloned is 77% identical to the endogenous human retroviral element 
S71, which is a truncated retroviral element homologous to the Simian Sarcoma Virus 
(SSV). S71 contains an incomplete gag gene, a portion of the pol gene and an LTR- 
like structure at the 3' terminus {see Werner et al. Virology 77^:225-238 (1990)). 

30 BlSAgl is also 64% identical to SSV in the region corresponding to the P30 (gag) 
locus. BlSAgl contains three separate and incomplete reading frames covering a 
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region which shares considerable homology to a wide variety of gag proteins of 
retroviruses which infect mammals. In addition, the homology to S71 is not just within 
the gag gene, but spans several kb of sequence including an LTR, 

BlSAgl -specific PGR primers were synthesized using computer 
5 analysis guidelines. RT-PCR amplification (94T, 30 seconds; 60°C 42^C, 30 
seconds; 30 seconds for 40 cycles) confirmed that BlSAgl represents an actual 
mRNA sequence present at relatively high levels in the patient's breast tumor tissue. 
The primers used in amplification were B18Agl-l (CTG CCT GAG CCA CAA ATG) 
(SEQ ID NO.:I28) and B18Agl-4 (CCG GAG GAG GAA GCT AGA GGA ATA) 
10 (SEQ ID NO.:129) at a 3.5 mM magnesium concentration and a pH of 8.5, and 
B18Agl-2 (ATG GCT ATT TTC GGG GCC TGA CA) (SEQ ID NO.:126) and 
B18Agl-3 (CCG GTA TCT CCT CGT GGT TATT) (SEQ ID NO.:I27) at 2 mM 
magnesium at pH 9.5. The same experiments showed exceedingly low to nonexistent 
levels of expression in this patient's normal breast tissue {see Figure 1). RT-PCR 
1 5 experiments were then used to show that B 1 8Ag 1 mRNA is present in nine other breast 
tumor samples (from Brazilian and American patients) but absent in, or at exceedingly 
low levels in, the normal breast tissue corresponding to each cancer patient. RT-PCR 
analysis has also shown that the BlSAgl transcript is not present in various normal 
tissues (including lymph node, myocardium and liver) and present at relatively low 
20 levels in PBMC and lung tissue. The presence of BlSAgl mRNA in breast tumor 
samples, and its absence from normal breast tissue, has been confirmed by Northern 
blot analysis, as shovm in Figure 2. 

The differential expression of BlSAgl in breast tumor tissue was also 
confirmed by RNase protection assays. Figure 3 shows the level of BlSAgl mRNA in 
25 various tissue types as determined in four different RNase protection assays. Lanes 1- 
12 represent various normal breast tissue samples, lanes 13-25 represent various breast 
tumor samples; lanes 26-27 represent normal prostate samples; lanes 28-29 represent 
prostate tumor samples; lanes 30-32 represent colon tumor samples; lane 33 represents 
normal aorta; lane 34 represents normal small intestine; lane 35 represents normal skin, 
30 lane 36 represents normal lymph node; lane 37 represents normal ovary; lane 38 
represents normal liver; lane 39 represents normal skeletal muscle; lane 40 represents a 
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first normal stomach sample, lane 41 represents a second normal stomach sample; lane 
42 represents a normal lung; lane 43 represents normal kidney; and lane 44 represents 
normal pancreas. Interexperimental comparison was facilitated by including a 
positive control RNA of known B-actin message abundance in each assay and 
5 normalizing the results of the different assays with respect to this positive control. 

RT-PCR and Southern Blot analysis has shown the B18Agl locus to be 
present in human genomic DNA as a single copy endogenous retroviral element. A 
genomic clone of approximately 12-18 kb was isolated using the initial BlSAgl 
sequence as a probe. Four additional subclones were also isolated by Xbal digestion. 

10 Additional retroviral sequences obtained from the ends of the Xbal digests of these 
clones (located as shown in Figure 4) are shown as SEQ ID N0:3 - SEQ ID NO: 10, 
where SEQ ID N0:3 shows the location of the sequence labeled 10 in Figure 4, SEQ ID 
N0:4 shows the location of the sequence labeled 11-29, SEQ ID N0:5 shows the 
location of the sequence labeled 3, SEQ ID N0:6 shows the location of the sequence 

15 labeled 6, SEQ ID N0:7 shows the location of the sequence labeled 12, SEQ ID N0:8 
shows the location of the sequence labeled 13, SEQ ID N0:9 shows the location of the 
sequence labeled 14 and SEQ ID NO: 10 shows the location of the sequence labeled 1 1- 
22. 

Subsequent studies demonstrated that the 12-18 kb genomic clone 
20 contains a retroviral element of about 7.75 kb, as shown in Figures 5A and 5B. The 
sequence of this retroviral element is shown in SEQ ID NO: 141 . The numbered line at 
the top of Figure 5A represents the sense strand sequence of the retroviral genomic 
clone. The box below this line shows the position of selected restriction sites. The 
arrows depict the different overlapping clones used to sequence the retroviral element. 
25 The direction of the anow shows whether the single-pass subclone sequence 
corresponded to the sense or anti-sense strand. Figure 5B is a schematic diagram of the 
retroviral element containing BlSAgl depicting the organization of viral genes within 
the element. The open boxes correspond to predicted reading frames, starting with a 
methionine, found throughout the element. Each of the six likely reading frames is 
30 shown, as indicated to the left of the boxes, with frames 1-3 corresponding to those 
found on the sense strand. 
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Using (he cDNA of SEQ ID N0:1 as a probe, a longer cDNA was 
obtained (SEQ ID NO:227) which contains minor nucleotide differences (less than 1%) 
compared to the genomic sequence shown in SEQ ID NO: 141 . 

^ ^ Preparation of cDNA Molecules Encodinp Other Breast T.,mor-Snecifir 

Polypeptides 

Normal RNA and tumor RNA was prepared and mRNA was isolated 
and converted into cDNA using a (dT)„AG anchored 3' primer, as described above. 
Differential display PGR was then executed using the randomly chosen primers SEQ 
1 0 ID NO.: 87- 1 25. Amplification conditions were as noted above, and bands observed to 
be specific to the RNA fingeiprint pattern of the tumor were cut out of a silver stained 
gel, subcloned into either the T-vector (Novagen, Madison, WI) or the pCRII vector 
(Invitrogen, San Diego, CA) and sequenced. The sequences are provided in SEQ ID 
N0:11 - SEQ ID NO:86. Of the 79 sequences isolated, 67 were found to be novel 
15 (SEQ ID NO.: 11-77) (see also Figures 6-20). Subsequent studies identified an 
additional 84 sequences (SEQ ID NOS: 142-226). of which 72 appeared to be novel 
(SEQ ID NOS:142, 143, 146-152. 154-166, 168-176, 178-192, 194-198, 200-204, 206, 
207, 209-214, 216, 218. 219, 221-227). To the best of the inventors' knowledge none 
of the previously identified sequences have heretofore been shown to be expressed at a 
20 greater level in human breast tumor tissue than in normal breast tissue. 

Table I shows the level of representative breast tumor-specific 
transcripts present in normal breast tissue (columns BNI -BN7). breast tumor samples 
(columns BTI-BT12) and normal prostate, kidney, liver, lung, skin, small intestine, 
stomach, myocardium, lymph node, pancreas, skeletal muscle, ovary and aorta, as 
25 determined by RT-PCR analysis. A 0-3 grading scale for message abundance is used, 
with 0 denoting no detectable message and 3 a message level comparable to the control 
message (glyceraldehyde 3-phosphate dehydrogenase). The lack of data in a given box 
indicates that the tissue has not been tested for the presence or absence of that specific 
antigen. 

30 
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Example 2 

Preparation of BlSAgl DNA from Human Genomic DNA 

This Example illustrates the preparation of BlSAgl DNA by 
5 amplification from human genomic DNA, 

BI8Agl DNA may be prepared from 250 ng human genomic DNA 
using 20 pmol of BlSAgl specific primers, 500 pmol dNTPS and 1 unit of Tag DNA 
polymerase (Perkin Elmer, Branchburg, NJ) using the following amplification 
parameters: 94°C for 30 seconds denaturing, 30 seconds 60°C to 42°C touchdown 
1 0 annealing in 2°C increments every two cycles and 72°C extension for 30 seconds. The 
last increment (a 42X annealing temperature) should cycle 25 times. Primers were 
selected using computer analysis. Primers synthesized were BlSAgM, BlSAgl -2, 
B18AgI-3, andB18Agl-4. Primer pairs that may be used are 1+3, 1+4, 2+3, and 2+4. 

Following gel electrophoresis, the band corresponding to BlSAgl DNA 
1 5 may be excised and cloned into a suitable vector. 

Example 3 

Preparation of Bl SAel DNA from Breast Tumor cDNA 

20 This Example illustrates the preparation of BlSAgl DNA by 

amplification from human breast tumor cDNA. 

First strand cDNA is synthesized from RNA prepared from human 
breast tumor tissue in a reaction mixture containing 500 ng poly A+ RNA, 200 pmol of 
the primer (T),2AG {i.e., TTT TTT TTT TTT AG) (SEQ ID NO: 130), IX first strand 

25 reverse transcriptase buffer, 6.7 mM DTT, 500 mmol dNTPs, and 1 unit AMV or 
MMLV reverse transcriptase (from any supplier, such as Gibco-BRL (Grand Island, 
NY)) in a final volume of 30 |al. After first strand synthesis, the cDNA is diluted 
approximately 25 fold and 1 |il is used for amplification as described in Example 2. 
While some primer pairs can result in a heterogeneous population of transcripts, the 

30 primers Bl 8Agl-2 (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID NO: 126) 
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and Bl 8Agl-3 (5'CCG GTA TCT CCT CGT GOG TAT T) (SEQ ID NO: 127) yield a 
single 151 bp amplification product. 

Example 4 

^ Identificatio n of B-cell and T-cell Epitopes of BlSAgl 

This Example illustrates the identification of B18Agl epitopes. 
The B18Agl sequence can be screened using a variety of computer 
algorithms. To determine B-cell epitopes, the sequence can be screened for 

1 0 hydrophobicity and hydrophilicity values using the method of Hopp, Prog. Clin. Biol. 
Res. J 728:367-77 (1985) or, alternatively, Cease et al., J. Exp. Med 1779-84 
(1986) or Spouge et al., J. Immunol. 138:204-12 (1987). Additional Class II MHC 
(antibody or B-cell) epitopes can be predicted using programs such as AMPHI (e.g., 
Margalit et al., J. Immunol. 138:2213 (1987)) or the methods of Rothbard and Taylor 

15 (e.g., £A/fiOy. 7:93(1988)). 

Once peptides (15-20 amino acids long) are identified using these 
techniques, individual peptides can be synthesized using automated peptide synthesis 
equipment (available from manufacturers such as Applied Biosystems, Inc., Foster 
City, CA) and techniques such as Merrifield synthesis. Following synthesis, the 

20 peptides can used to screen sera harvested from either normal or breast cancer patiems 
to determine whether patients with breast cancer possess antibodies reactive with the 
peptides. Presence of such antibodies in breast cancer patient would confirm the 
immunogenicity of the specific B-cell epitope in question. The peptides can also be 
tested for their ability to generate a serologic or humoral immune in animals (mice, rats, 

15 rabbits, chimps etc.) following immunization in vivo. Generation of a peptide-specific 
antiserum following such immunization further confirms the immunogenicity of the 
specific B-cell epitope in question. 

To identify T-cell epitopes, the B18Agl sequence can be screened using 
different computer algorithms which are useful in identifying 8-10 amino acid motifs 

10 within the BlSAgl sequence which are capable of binding to HLA Class I MHC 
molecules, (see, e.g., Rammensee et al.. Immunogenetics 47:178-228 (1995)). 
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Following synthesis such peptides can be tested for their ability to bind to class I MHC 
using standard binding assays {e.g., Sette et al., J. Immunol. 755:5586-92 (1994)) and 
more importantly can be tested for their ability to generate antigen reactive cytotoxic T- 
cells following"^/>i vitro stimulation of patient or normal peripheral mononuclear cells 
5 using, for example, the methods of Bakker et al., Cancer Res. 55:5330-34 (1995); 
Visseren et al., J. Immunol. 75^:3991-98 (1995); Kawakami et al., .7. Immunol. 
134:3961-6^ (1995); and Kast et al., J. Immunol. 752:3904-12 (1994). Successftil 
in vitro generation of T-cells capable of killing autologous (bearing the same Class I 
MHC molecules) tumor ceils following in vitro peptide stimulation fiirther confirms the 

10 immunogenicity of the BlSAgl antigen. Furthermore, such peptides may be used to 
generate murine peptide and BlSAgl reactive cytotoxic T-cells following in vivo 
immunization in mice rendered transgenic for expression of a particular human MHC 
ClassIhaplotype(Vitielloetal.,7. Exp. Med 775:1007-15(1991). 

A representative list of predicted B18Agl B-cell and T-cell epitopes, 

15 broken down according to predicted HLA Class 1 MHC binding antigen, is shown 
below: 

Predicted Th Motifs (B-cell epitopes^ (SEP ID NOS : 131-133) 
SSGGRTFDDFHRYLLVGI 
20 QGAAQKPINLSKXIEVVQGHDE 
SPGVFLEHLQEAYRIYTPFDLSA 

Predicted HLA A2.1 Motifs (T-cell epitopes^ rSFO ID NO.S • 134-140) 

YLLVGIQGA 
25 GAAQKPINL 

NLSKXIEVV 

EVVQGHDES 

HLQEAYRIY 

NLAFVAQAA 
30 FVAQAAPDS 
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Example 5 

Characterization of Breast Tumor Genes Discovered bv Differential Display PCR 

5 The specificity and sensitivity of the breast tumor genes discovered by 

differential display PCR were determined using RT-PCR. This procedure enabled the 
rapid evaluation of breast tumor gene mRNA expression semiquantitatively without 
using large amounts of RNA. Using gene specific primers, mRNA expression levels in 
a variety of tissues were examined, including 8 breast tumors, 5 normal breasts, 2 
10 prostate tumors, 2 colon tumors, 1 lung tumor, and 14 other normal adult human 
tissues, including normal prostate, colon, kidney, liver, lung, ovary, pancreas, skeletal 
muscle, skin, stomach and testes. 

To ensure the semiquantitative nature of the RT-PCR, p-actin v^as used 
as internal control for each of the tissues examined. Serial dilutions of the first strand 

1 5 cDNAs were prepared and RT-PCR assays performed using P-actin specific primers. A 
dilution was then selected that enabled the linear range amplification of P-actin 
template, and which was sensitive enough to reflect the difference in the initial copy 
number. Using this condition, the p-actin levels were determined for each reverse 
transcription reaction from each tissue. DNA contamination was minimized by DNase 

20 treatment and by assuring a negative result when using first strand cDNA that was 
prepared without adding reverse transcriptase. 

Using gene specific primers, the mRNA expression levels were 
determined in a variety of tissues. To date 32 genes have been successfully examined 
by RT-PCR, three of which exhibit good specificity and sensitivity for breast tumors. 

25 Figures 21 A and 21 B depict the results for these three genes: B15AG-1 (SEQ ID 
NO:27), B31GAlb (SEQ ID NO: 148) and B38GA2a (SEQ ID NO. 157). 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for the purpose of 
illustration, various modifications may be made without deviating from the spirit and 

30 scope of the invention. 
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SEQUENCC LISTING 



(1) GFNERAL INFORMATION: 

I (1) APPLICANT: Conxa Corporation 

(11) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE 
- TREATMENT AND DIAGNOSIS OF BREAST CANCER 

(Hi) NUMBER OF SEQUENCES: 227 

(1v) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE; SEED and BERRY LLP 
I (B) STREET: 6300 Columbia Center. 701 Fifth Avenue 

e CO CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi) CURRENT APPLICATION DATA: 
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(B) FILING DATE: lO-JAN-1997 
^ (C) CLASSIFICATION: 

(vrii) ATTORNEY/AGENT INFORMATION: 
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(B) REGISTRATION NUMBER: 31.39? 

(C) REFERENCE/DOCKET NUMBER: 210121. 419PC 
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(IX) TFI ECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 

I 

I (2) INFORMATION FOR SEQ ID NOl 

i 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 
■i (A) NAME/KEY: CDS 

\ (B) LOCATION: l.,363 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



TTA GAG ACC CAA TFG GGA CCT AAT TGG GAC CCA AAT TTC TCA AGT GGA 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
' 5 10 15 



48 



GGG AGA ACT TTT GAC GAT HC CAC CGG TAT CTC CTC GTG GGT ATT CAG 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly lie Gin 
20 25 30 



96 



GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
35 40 45 



144 



CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 
Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
^0 55 60 



192 
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GCT TAT CGG ATT TAG ACC CCT TTT GAG CTG GCA GCC CCC GAA AAT AGC 
Ala lyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Scr 
65 70 75 80 



240 



CAT GCT CTT AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 
His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 % 



288 



AGG AAA CTC CAA AAA CTA GAG 66A TTT TGC TGG AAT GAA TAC CAG TCA 
Arq Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 



336 



GCT TTT AGA GAT AGC CTA AAA GGT TTT 
Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 



363 



(?) INFORMATION FOR SEQ ID N0:2: 

i^) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEO ID N0:2: 

Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
15 10 15 

Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly lie Gin 
20 25 30 



Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
3b 40 45 
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Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 

55 60 

Ma Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 95 

Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 

Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 1?0 

(2) INFORMATION FOR SEO ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEO ID N0:3: 

TCTTAGAATC TTCATACCCC GAACTCTTGG 6AAAACTTTA ATCAGTCACC TACAGTCTAC 60 

CACCCATTTA GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTTTA AGATCCCCCA 120 

TCTTCAAAGC CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC AGGTAAATGC 180 

CAAAAAAGGl CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC CAGGAGAAAA 240 

GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT ACCTTCTAG1 300 
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ACTGGTAGAC ACCTTCTCTG GATGGACTGA AGCATTTGCT ACCAAAAACG AAACTGTCAA 360 

TATGGTAGTT AAGTTTTTAC TCAATGAAAT CATCCCTCGA CGTGGGCTGC CTGTTGCCAT 420 

AGGGTCTGAT AAT6GAACGG CCTTCGCCTT GTCTATAGTT TAATCAGTCA GTAAGGCGTT 480 

AAACATTCAA TGGAAGCTCC ATTGTGCCTA TCGACCCAGA GCTCTGGGCA AGTAGAACGC 540 

ATGAACTGCA CCCTAAAAAA ACACTCTTAC AAAATTAATC TTAAAAACCG GTGTTAATTG 600 

TGTTAGTCTC CTTCCCTTAG CCCTACTTA6 AGTTAAGGTG CACCCCTTAC TGGGCTGGGT 660 

TCTTTACCTT TTGAAATCAT NTTTNGGAAG GGGCTGCCTA TCTTTNCTTA ACTAAAAAAN 720 

GCCCATTTGG CAAAAATTTC NCAACTAATT TNTACGTNCC TAC6TCTCCC CAACAGGTAN 780 

AAAAATCTNC TGCCCTTTTC AA66AACCAT CCCATCCATT CCTNAACAAA AGGCCTGCCN 840 

TTCTTCCCCC AGTTAAC7NT TTTTTNTTAA AATTCCCAAA AAAN6AACCN CCTGCTG6AA 900 

AAACNCCCCC CTCCAANCCC CGGCCNAAGN GGAAGGTTCC CTTGAATCCC NCCCCCNCNA 960 

ANGGCCCGuA ACCNTTAAAN TNGTTCCNG6 GG6TNNGGCC TAAAAGNCCN ATTTGGTA^ 1020 

CCTANAAATT TTTTCTTTTN TAAAAACCAC NNTTTNNm TTCTTAAACA AAACCCTNTl 1080 

TNTAGNANCN TATTTCCCWC C 1101 
(2) INFORMATION FOR SEO ID N0:4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(XI) SEQUENCE DESCRIPTION: SEO ID N0:4: 
TCTAGAGCTG CGCCTGGATC CCGCCACAGT GAGGAGACCT GAAGACCAGA GAAAACACAG 60 
CAAGTAGGCC CTTTAAACTA CTCACCTGTG TTGTCTTCTA ATTTATTCTG TTTTATTTT6 120 
TTTCCATCAT TTTAAGGGGT TAAAATCATC TTGTTCAGAC CICAGCATAT AAAATGACCC 180 
ATCTGTAGAC CTCAGGCTCC AACCATACCC CAAGAGTTGT CTGGTTTTGT TTAAATTAC7 240 
GCCAGGTTTC AGCTGCAGAT ATCCCTGGAA 6GAATATTCC AGATTCCCTG AGTAGTTTCC 300 
AG6TTAAAAT CCTATAGGCT TCTTCTGTTT T6AG6AAGAG TTCCTGTCAG AGAAAAACAT 360 
GATTTTGGAT TTTTAACTIT AATGCHCTG AAACGCTATA AAAAAAATTT TCTACCCCTA 420 
GCTTTAAAGT ACTGTTAGTG AGAAATTAAA ATTCCTTCAG GAGGATTAAA CTGCCATTTC 480 

AGTTACCCTA ATTCCAAATG TTTTGGTGGT TAGAATCTTC TTTAATGTTC TTGAAGAAGT 540 

GTTTTATATT TTCCCATCNA GATAAATTCT CTCNCNCCTT NNTTTTNTNT CTNNTTTTTT 6O0 

AAAACGGANT CTTGCTCCGT TGTCCANGCT GGGAATTTTN TTTTGGCCAA TCTCCGCTNC 660 

CTTGCAANAA TNCTGCNTCC CAAAATTACC NCCTTTTTCC CACCTCCACC CCNNG6AATT 720 

ACCTGGAATT ANAG6CCCCC NCCCCCCCCC CGGCTAATTT GTTTTTGTTT TTAGTAAAAA 780 

ACGGGTTTCC TGTTTTAGIT AGGATGGCCC ANNTCTGACC CCNTNATCNT CCCCCTCNGC 840 

CCTCNAATNT TNGGNNTANG GCTTACCCCC CCCNGNNGTT TncCTCCAT TNAAATTTTC 900 

TNTGGANTCT TGAATNNCGG GTTTTCCCTT TTAAACCNAT TTTTnTTTN NNNCCCCCAM 960 
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TTTTNCCTCC CCCNTNTNTA ANGGGGGTTT CCC/V\NCCG6 GTCCNCCCCC ANGTCCCCAA 1020 

TTTTTCTCCC CCCCCCTCTT TTTTCJTTNC CCCAAAANTC CTATCTTTTC CTNNAAATAT 1080 

CNANTNT 108/ 
(?) INFORMATION FOR SLO ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SCO ID NO: 5: 

TCTAGACCAA 6AAATG6GAG GATTTTAGAG TGACTGATGA TTTCTCTATC ATCTGCAGTT 60 

AGTAAACATT CTCCACAGTT TATGCAAAAA GTAACAAAAC CACTGCAGAT GACAAACACT 120 

AGGTAACACA CATACTATCT CCCAAATACC TACCCACAAG CTCAACAATT TTAAACTGTT 180 

AGGATCACTG GCTCIAATCA CCATGACAIG AGGTCACCAC CAAACCATCA AGCGCTAAAC 240 

AGACAGAATG TTTCCACTCC TGATCCACTG TGTGGGAAGA AGCACCGAAC TTACCCACTG 300 

GGGGGCCTGC NTCANAANAA AAGCCCATGC CCCCGGGTNT NCCTTTNAAC CG6AAC6AAT 360 

NAACCCACCA TCCCCACANC TCCTCTGTTC NTGGGCCCTG CATCTTGTGG CCTCNTNINC 420 

TTTNGGGGAN ACNTG6GGAA G6TACCCCAT TTCNTTGACC CCNCNANAAA ACCCCNGTGG 480 

CCCTTTGCCC TGATTCNCNT GGGCCTTTTC TCTTTTCCCT TTTGG6TTGT TTAAATTCCC 540 

AATGTCCCCN GAACCCTCTC CNTNCTGCCC AAAACCTACC TAAATTNCTC NCTANGNNTT 600 
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rTCTTGGTGI TNCJTTTCAA AGGTNACCTT NCCTGTICAN NCCCNACNAA AATTTNTTCC 660 

NIAINNTGGN CCCNNAAAAA NNNATCNNCC CNAATTGCCC GAATTGGTTN GGTTTTTCCT 7?0 

NCTGGGGGAA ACCCTTTAAA TTTCCCCCTT GGCCGGCCCC CCTTTTTTCC CCCCTTTNGA 780 

AGGCAGGNGG TTCTTCCCGA ACTTCCAATT NCAACAGCCN TGCCCAITGN TGAAACCCTT 840 

TTCCTAAAAT TAAAAAATAN CCGGTTNNGG NNGGCCTCTT TCCCCTCCNG GNGGGNNGN6 900 

AAANTCCTTA CCCCNAAAAA GGTTGCTTAG CCCCCNGTCC CCACTCCCCC NGGAAAAATN 960 

AACCTTTTCN AAAAAAGGAA TATAANTTTN CCACTCCTTN GTTC1CTTCC 1010 
(2) INFORMATION FOR SEO ID N0:6: 

(1) SLQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

TCTAGAGCTC GCGGCCGCGA GCTCTAATAC GACTCACTAT AGGGCGTCGA CTCGATCTCA 60 

GCTCACTGCA ATCTCTGCCC CCGGGGTCAT GCGATTCTCC TGCCTCA6CC HCCAAGTAG 120 

CTGGGATTAC AGGCGTGCAA CACCACACCC GGCTAATTTT GTATHTTAA TAGA6ATGGG 180 

GTTTTCCCTT GTTGGCCANN ATGGTCTCNA ACCCCTGACC TCNNGTGATC CCCCCNCCCN 240 

NGANCTCNNA CTGCTGGGGA TNNCCGNNNN NNNCCTCCCN NCNCNNNNNN NCNCNNTCCN 300 
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TNNTCCTTNC TCNNNNNNNN CNNTCNNTCC NNCTTCTCNC CNNNTNTTNT CNNCNNCCNN 360 

CNNNCCNCNT NCCCNCNNNT TCNCNTNCNN TNTCCNNCNN NNTCNNCNNN CNNNNCNINN 420 

CCNNTACNTC NTNNNCNNNT CCNTCTNTNN CCTCNNCNNT CNCTNCNCNT TNTCTCCTCN 480 

NTNNNNNNCT CCNNNNNTCT CNTCNCNNCN INCCTCNNTN NCCNCNCCCC NCCTCNCNNC 540 

CTNNTTTNNN CNNCNNNTCC NTNCCNTTCN NNTCCNNTNN CNNCNTCNCN NNCNTTNTTC 600 

CCNCCNNTTC CTTNCNCNIN NNNTNTCNNN CNCNTCNNTC NTTTNCTCCT NNNTCCCNNC 660 

TCNNTTCNCC CMNNTCCNCC CCCCNCCTNT CTCTCNCCCN NNTNNNTNTN NNNCNTCCNC 720 

INTCNCNTTC NTCNNTNCNT TNCTNTCNNC NNCNNTNCNC TNCCNTNTNT CTNNNTCNCN 780 

TCNCNTNTCN CCNTCCNTTN CTNTCTCCTN TNTCCTTCCC CTCNCCTNCT CNTTCNCCNC 840 

CCNNTNTNTN TNNCNCCNNT NCTNNNCNNC CNTCNTTTCN TCTCTNCINN NNNINNCCTC 900 

NNCCCNTNCC CTNNTNCNCT NCTNNTACCN TNCTNCTCCN TCTTCCTTCC 950 
(2) INFORMATION FOR SF.Q ID NO: 7: 

(1) SLOUENCl CHARACTERISTICS: 

(A) LENGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:7: 
TCTAGAGCTC GCGGCCGCGA GCTCAATTAA CCCTCACTAA AGGGAGTC6A CTCGATCAGA 60 
CTGTTACTGT GTCTATGTAG AAA6AAGTAG ACATAAGAGA TTCCATTTTG TTCTGTACTA 1?0 



NSOOCID: <WO_e725426A3JB> 



wo 97/25426 PCT/US97/00485 

43 

AGAAAAATTC TTCTGCCTTG AGATGCTGTT AATCTGTAAC CCTAGCCCCA ACCdGIGCT 18C 

CACAGAGACA TGTGCTGTGT TGACTCAAGG TTCAATGGAT TTAGGGCTAT GCTTTGTTAA 2/10 

AAAAGTGCTT GAAGATAATA TGCTTGTTAA AAGTCATCAC CAITCTCTAA TCTCAAGTAC 300 

CCAGGGACAC AATACACTGC GGAAGGCCCC AGGGACCTCT GTCTAGGAAA GCCAGGTATT 360 

GTCCAAGATT TCTCCCCAT6 TGATAGCCTG AGATATGGCC TCATGGGAAG GGIAAGACCT 4?0 

GACTGTCCCC CAGCCCGACA TCCCCCA6CC CGACATCCCC CAGCCCGACA CCCGAAAAGG 480 

GTCTGTGCTG AGGAAGATTA NTAAAA6AGG AAGGCTCTTT 6CATTGAAGT AAGAAGAAG6 540 

CTCTGTCTCC TGCTCGTCCC T6GGCAATAA AATGTCTTGG TGTTAAACCC GAATGTATGT 600 

TCTACTTACT GAGAATAGGA GAAAACATCC TTAGGGCTGG AGGTGAGACA CCCTGGCGGC 660 

ATACTGCTCT TTAATGCACG AGATGTTTGT NTAATTGCCA TCCAGGGCCA NCCCCTTTCC 720 

TTAACTTTTT ATGANACAAA AACTTTGHC NCTTTTCCTG CGAACCTCTC CCCCTATTAN 780 

CCTAT-^GGCC TGCCCATCCC CTCCCCAAAN GGTGAA/^/.NA FGTTCNTAAA TNCGAuuGAA 840 

TCCAAAACNT TTTCCCGTTG 6TCCCCTTTC CAACCCCGTC CCTGGGCCNN TTTCCTCCCC 900 

AACNTGTCCC GGNTCCTTCN TTCCCNCCCC CTTCCCNGAN AAAAAACCCC GTNTGANGGN 960 

GCCCCCTCAA ATTATAACCT TTCCNAAACA AANNGGTTCN AAGGTGGTTT GNTTCCGGTG 1020 

CGGCTGGCCT TGAGGTCCCC CCTNCACCCC AATTTGGAAN CCNGTTTTTT TTATTGCCCN 1080 

NTCCCC 1086 
(2) INFORMATION FOR SEO ID N0:8: 



i 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



SEQUENCE DESCRIPTION: SLQ ID N0:8: 

NCCNTTTAGA TGTTGACAAN NTAAACAAGC N6CTCAGGCA GCTGAAAAAA GCCACTGATA 60 

AAGCATCCTG GAGTATCAGA GTTTACTGTT AGATCAGCCT CATTTGACTT CCCCTCCCAC 120 

ATGGTGTTTA AATCCAGCTA CACTACTTCC TGACTCAAAC TCCACTATTC CTGTTCATGA 180 

CTGTCAGGAA CTGTTG6AAA CTACT6AAAC TGGCCGACCT GATCTTCAAA ATGT6CCCCT ?40 

AGGAAAGGTG GATGCCACCG TGTTCACAGA CAGTACCNCC TTCCTCGAGA AGGGACTACG 300 

AGGG6CCGGT 6CANCTGTTA CCAAGGAGAC TNATGTGTTG TGGGCTCAGG CTTTACCANC 360 

AAACACCTCA NCNCNNAAGG CTGAATTGAT CGCCCTCACT CAGGCTCTCG GATGGGGTAA 420 

GGGATAVIAA CGTTAACACT GACAGCAGGT ACGCCTTTGC TACTGTGCAT GTACGTGGAG 480 

CCATCTACCA 6GAGCGTGGG CTACTCACTC GGCAGGTGGC TGTNATCCAC TGTAAAN6GA 540 

CATCAAAA6G AAAACNNGGC TGTTGCCCGT GGTAACCANA AANCTGATCN NCAGCTCNAA 600 

GATGCTGTGT TGACTTTCAC TCNCNCCTCT TAAACTTGCT GCCCACANIC TCCTTTCCCA 660 

ACCAGATCTG CCTGACAATC CCCATACTCA AAAAAAAAAN AANACTGGCC CCGAACCCNA 720 

ACCAATAAAA ACGGGGANGG TNGGINGANC NNCCTGACCC AAAAATAATG GATCCCCC6G 780 
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GCTGCAGGAA TTCAATTCAN CCTTATCNAT ACCCCCAACN NGGNGGGGGG GGCCNGTNCC 840 

CATTNCCCCT NTATTNATTC TTTNNCCCCC CCCCCGGCNT CCTTTTTNAA CTCGTGAAAG 900 

GGAAAACCTG NCTTACCAAN TTATCNCCTG GACCNTCCCC TTCCNCGGTN GNTTANAAAA 960 

AAAAGCCCNC ANTCCCNTCC NAAATTTGCA CNGAAAGGNA AGGAATTTAA CCTTTATTTT 1020 

TTNNTCCTTT ANTTTGTNNN CCCCCTTTTA CCCAGGCGAA CNGCCAICNT TTAANAAAAA 1080 

AAANAGAANG TTTATTTTTC CTTNGAACCA TCCCAATANA AANCACCCGC NGGGGAACGG 1140 

GGNGGNAGGC CNCTCACCCC CTTTNTGTNG GNG6GNC 1177 
(2) INrORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) S7RANDE0NESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:9: 

NCCNNTTNNT GATGTTCTCT TTTTGGCCTC TCTTTGGATA CTTTCCCTCT CHCAGAGGT 60 

GAAAAGGGTC AAAAGGAGCT GTTGACAGTC ATCCCAGGTG GGCCAATGTG TCCAGAGTAC 120 

AGACTCCATC AGTGAGGTCA AAGCCTGGGG CTTTTCAGA6 AAGGGAGGAT TATGGGTTH 180 

CCAATTATAC AAGTCAGAAG TAGAAAGAAG GGACATAAAC CAGGAAGGGG GTG6AGCACT 240 

CATCACCCA6 AGGGACTTGT GCCTCTCTCA GTG6TAGTAG AGGGGCTACT TCCTCCCACC 300 

ACGGTTGCAA CCAAGAGGCA ATGGGTGATG AGCCTACAGG GGACATANCC GAGGAGACAT 360 
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GGGATGACCC lAAGGGAGTA GGCTGGTTTT AAGGCGGTGG GACTGGGTGA GGGAAACTCI 420 

I CCTCTTCTTC AGAGAGAAGC AGTACAGGGC 6AGCTGAACC GGCIGAAGGT C6A6GCGAAA 480 

I ACACGGTCTG GCTCAGGAAG ACCTTGGAAG TAAAATTATG AATGGTGCAl GAAT6GAGCC 540 

I 

ATGGAAGGGG TGCTCCTGAC CAAACTCAGC CATTGATCAA TGTTAGGGAA ACTGATCAGG 600 
GAAGCCGGGA ATTTCATTAA CAACCCGCCA CACAGCTTGA ACATTGTGAG GTTCAGTGAC 660 
CCTICAAGGG GCCACTCCAC TCCAACTTTG GCCATTCIAC TTTGCNAAAT TTCCAAAACT 7?0 
TCCTTTTTTA AGGCCGAATC CNTANTCCCT NAAAAACNAA AAAAAATCTG CNCCTATTCT 780 

I GGAAAAGGCC CANCCCTTAC CAGGCTGGAA GAAATTTTNC CTTTTTHTT TTTTTGAAGG 840 

CNTTTNTTAA ATTGAACCTN AATTCNCCCC CCCAAAAAAA AACCCNCCNG GGGGGCGGAT 900 
TTCCAAAAAC NAATTCCCTT ACCAAAAAAC AAAAACCCNC CCTTNTTCCC TTCCNCCCTN 960 
TTCTTTTAAT TAGGGAGAGA TNAAGCCCCC CAATTTCCNG GNCTNGATNN GTTTCCCCCC 1020 
CCCCCATTTT CCNAAACTTT TTCCCANCNA GGAANCCNCC CTTTTTTTNG GTCNGATTNA 1080 
NCAACCTTCC AAACCATTTT TCCNNAAAAA NTTTGNTNGG NGGGAAAAAN ACCTNNTTTT im 

^ ATAGAN 

(2) INFORMATION FOR SEQ ID NO: 10: 

i 

(i) SEQUENCL CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
3 (D) TOPOLOGY: linear 



1146 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0;10: 

CTTCATTGGG TACGGGCCCC CTCGAGGTCG ACGGTATCGA TAAGCTTGAT ATCGAATTCC 60 

I TGCAGCCCGG GGGATCCACT AGTTCTAGAG TCAGGAAGAA CCACCAACCT TCCTGATTTT 120 

lATTGGCTCT GA6TTCTGAG GCCAGTTTTC TTCTTCTGTT GAGTATGCGG GATTGTCA6G 180 

CAGATCTGGC TGTGGAAAGG AGACTGTGGG CAGCAAGTTT AGAGGCGTGA CTGAAAGTCA 240 

CACTGCATCT TGAGCTGCTG AATCAGCTTT CTGGTTACCA CGGGCAACAG CCGTGTTTTC 300 

CTTTTGATGT CCTTTACAGT GGATTACAGC CACCTGCT6A GGTGA6TAGC CCACGCTCCT 360 

GGTAGATG6C TCCACGTACA TGCACAGTAG CAAAGGCGTA CCTGCTGTCA GTGTTAACGT 420 

TAATATCCTT ACCCCATCGG AGAGCCTGAG TGAGGGCGAT CAATTCAGCC CTTTTGTGCT 480 

GAGGTGTTTG CTGGTTAAGC CCTGAACCCA CAACACATCT GTCTCCATGG TAACAGCTGC 540 

ACCGG 545 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:11; 
TCTCCTAGGC TGGGCACAGT GGCTCATACC TGTAATCCTG ACCGTTTCAG AGGCTCAGGT 60 
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GGGGG6ATCG CTTGAGCCCA AGATTTCAAG ACTAGTCTGG GTAACATA6T GAGACCCTAT 120 

CTCTACGAAA AAATAAAAAA ATGAGCCIGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 180 

AATCGAGCCT AGGAGA 196 
(?) INFORMATION FOR SLQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEO ID N0:12: 

TCTCCTAGGC ITGGGGGCTC TGACTAGAAA TTCAAGGAAC CTGGGATTCA AGTCCAACTG 60 

TGACACCAAC TTACACTGTG GNCTCCAATA AACTGCTTCT TTCCTATTCC CTCTCTATTA 120 

AATAAAATAA GGAAAACGAT GFCTGTGTAT AGCCAAGTCA GNTATCCTAA AA6GA6ATAC 180 

lAAGTGACAT 1AAATATCAG AATGTAAAAC CTGGGAACCA GGTTCCCA6C CTGGGATTAA 240 

ACTGACAGCA AGAAGACTGA ACAGTACTAC TGTGAAAAGC CCGAAGNG6C AATATGTTCA 300 

CTCTACCGTT 6AAGGATGGC TGGGA6AATG AATGCTCTGT CCCCCAGTCC CAAGCTCACT 360 

TACTATACCT CCTTTATAGC CTAGGAGA 388 

(?) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 337 base pairs 
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(B) TYPE: nucle-ic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY; linear 

(x1) SEOUENCE DESCRIPTION: SEQ ID N0:13: 

TAGTAGTTGC CTAIAATCAT GTTTCTCATT ATTTTCACAT TTTATTAACC AATTTCTGTT 60 

TACCCTGAAA AATATGAGGG AAATATATGA AACAGGGAGG CAATGTTCA6 ATAATTGATC 120 

ACAAGATATG ATTTCTACAT CAGAT6CTCT TTCCTTTCCT GTTTATTTCC TTTTTATTTC 180 

GGTTGTGGGG TCGAATGTAA TAGCTTTGTT TCAAGAGAGA GTTTTGGCAG TTTCTGTAGC 240 

TTCTGACACT GCTCATGTCT CCAGGCATCT ATTTGCACTT TAGGA6GTGT CGTGGGAGAC 300 

TGAGAGGTCT ATTTTTTCCA TATTTGGGCA ACTACTA 337 
(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57] base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

TAGTAGTTGC CATACAGTGC CHTCCATTT ATTTAACCCC CACCTGAACG GCATAAACTG 60 

AGTGTTCAGC TGGTGTTTTT TACTGTAAAC AATAAGGAGA CTTTGCTCTT CATTTAAACC 120 

AAAATCATAT TTCATATTTT AC6CTCGAGG GTTTTTACCG 6TTCCTTTTT ACACTCCTTA 180 
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AAACAGTTTT TAAGTCGTTT GGAACAAGAT ATTTTTTCTT TCCTGGCAGC TTTTAACATT 2-10 
ATAGCAAATT TGTGTCTGG6 GGACT6CTG6 TCACTGTTTC TCACAGTTGC AAATCAAGGC 300 
ATTTGCAACC AAGAAAAAAA AATTTTTTT6 TTTTATTTGA AACT6GACCG GATAAACGGT 360 
I GTTTGGAGCG GCTGCTGTAT ATAGTTTTAA ATGGTTTATT GCACCTCCTT AAGTTGCACT 420 

^ TATGTGGGGG GGGGNTTTTG NATAGAAAGT NTTTANTCAC ANAGTCACAG GGACTTTTNT 480 

(^TTTTGGNNA CTGAGCTAAA AAGGGCTGNT TTTCGGGTGG GGGCAGATGA AGGCTCACAG 540 
GAGGCCTTTC TCTTAGAGG6 GGGAACTNCT A 57, 
(2) INFORMATION FOR SEQ 10 NO: 15: 

% 

(1) SEOUbNCE CHARACTERISTICS: 

(A) LENGTH: 548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
([)) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

TATATATTTA ATAACTTAAA TATATTHGA TCACCCACTG GGGTGATAAG ACAATAGATA 60 

- TAAAAGTATT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 120 

_ TCCCCCACCC GCACTGAAAC HCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 180 

AAGTCTTTGG TGCGTGCTCA CTACTCTTTl TTTTTTTTTT THNTTTTGG AGATGGAGTC 240 

TGGCTGTGCA GCCCAGGGGT 6GAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 300 

J CTCCCAGGTT CATGAGATTC TCCTGNHCA GCCTTCCCAG TAGCTGGGAC TACAGGTGTC 360 
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CATCACCATG CCTGGNTAAT CTTTTTTNG-^ TTTNGGGTAG AGATGGGGGT TTTACATGTT 420 
GGCCAGGNIG GTNTCGAACT CCTGACCTCA AGTGATCCAC CCACCTCAGG CTCCCAAAGT 480 
GCTAGGATTA CAGACATGAG CCACTGNGCC CAGNCCTGGT GCATGCTCAC flCTCTAGGC 540 



AACTACTA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(T) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nudeic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



548 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTCCGTTATG CACATGCAGA ATATTCTATC GGTACTTCAG CTATTACTCA TTTTGATGGC 60 

GCAATCC6AG CCTATCCTCA AGATGAGTAT TTAGAAAGAA TTGATTTAGC GATAGACCAA 120 

GCTGGTAAGC ACTCTGACTA CACGAAATTG TTCAGATGTG ATGGATTTAT GACAGTTGAT ISO 

CTTTGGAAGA GATTATTAAG TGATTATTTT AAAGGGAATC CATTAATTCC AGAATATCTT 240 

GGTTTAGCIC AAGATGATAT AGAAATAGAA CAGAAAGAGA CTACAAATGA AGATGTATCA 300 

CCAACTGATA TTGAAGAGCC TATAGTAGAA AATGAATTAG CT6CATTTAT TAGCCTTACA 360 

CATAGCGATT TTCCTGATGA ATCTTATATT CAGCCATCGA CATAGCATTA CCT6ATGGGC 420 

AACCTTACGA ATAATAGAAA CTGGGTGCGG GGCTATTGAT GAATTCATCC NCAGTAAATT 480 
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TGGATATNAC AAAATATAAC TCGATTGCAT TTGGATGATG GAATACTAAA TCTGGCAAAA 540 
GTAACTTTG6 AGCTACTAGT AACCTCTCTT TTTGAGATGC AAAATTTTCT TTTAGGGTTT 600 
CTTATTCTCT ACTTTACGGA TATTGGAGCA TAACGGGA 638 
(2) INFORMATION FOR SEQ 10 NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 

ACTGATGGAT 6TCGCCGGAG GCGA6GGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 120 

TGCCTTAGCG GCGGCGAAGT CAATGGGC6T CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

6ATGGC6GCT TCGGCGGCGT TTAT6ACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTCCT 240 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 286 
(2) INFORMATION FOR SEQ ID NO; 18: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(XI) SF-OUFNCE DESCRIPTION: SEQ ID NQ:18; 
TCGGTCATAG CAGCCCCTTC TTCTCAATH CATCTGTCAC TACCCTGGT6 TAGTATCTCA 60 
TAGCCTTACA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT T6ATTTTCCT GCCTGTAATC 120 

I 

I CATATCACAC ATAACTGCAA GTAAACATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT 180 

G7GNCAAGAA ATAGTTTCCA TTACC6TCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 240 

TCACTCTTCA CCTATGACCG AA 262 

(2) INrORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
I (A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO 19: 
TCGGTCATAG CAAAGCCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 60 
ATTTAIGATA AATGGTGGCA GGATTTTTAT TAIAAACATG TACCCATGCA AATTTCCTAT 120 
AACTCT6AGA TATATTCTTC TACATTTAAA CAATAAAAAT AATCTATTTT TAAAAGCCTA 180 
ATTTGCGTAG TTA6GTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 240 

i 

CGTTTCTCTG CCTATGACCG A 261 
(2) INFORMATION FOR SEQ ID N0:20: 
(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: ?94 base pairs 

(B) TYPt: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(XT) SEQUENCE DESCRIPTION; SEQ 10 N0:20: 

FACAACGAGG CGACGTCGGT AAAATCGC^C AT6AAGCCAC CGCTGGTCTT TTCGTCCGAG 60 

CGATAGGCGC CGGCCAGCCA 6CGGAACGGT TGCCCGGATG GCGAAGCGAG CCGGAGTTCT 120 

TC6GACTGAG TATGAATCTT GTTGTGAAAA 1ACTCGCCGC CTTCGTTCGA CGACGTC6CG 180 

TCGAAATCTT CGANCTCCTT ACGATCGAAG TCTTCGTG6G CGACGATCGC GGTCAGTTCC 240 

GCCCCACCGA AATCATGGTT GAGCCGGATG CTGNCCCCGA AGNCCTCGTT TGTN 294 
(2) INFORMATION TOR SEQ ID N0:21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
TTGGTAAAGG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 
ATCAATGAAT AGGAAAATTC CCAAA6A6GG AATGTCCT67 TGCTCGCCAG TTTTTNTGTT 
GTTCTCAIGG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 
CAACTAGTCG NCTTGCNANG ATCTTCAT 



60 
120 
180 
208 
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(2) INFORMATIOM FOR SEQ ID N0:22: 

{!) StOULNCE CHARACTERISTICS: 

(A) LENGTH: 287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

NCCNTTGAGC TGAGTGATTG AGATNTGTAA TGGTTGTAAG GGTGATTCAG GC6GATTAGG 60 

(iTGGCGGGTC ACCCGGCAGT GGGTCTCCC6 ACAGGCCAGC AGGATTTGGG GCAGGTACGG 120 

NGTGCGCATC GCTCGACTAT ATGCTATGGC AGGCGAGCCG TGGAAGGNGG ATCAGGTCAC 180 

GGCGCTGGAG CTTTCCACGG TCCATGNATT GNGATGGCTG TTCTA6GCGG CTGTTGCCAA 240 

GCGTGATGGT ACGCTG6CTG GAGCATTGAT TTCTGGTGCC AAGGTGG 287 
(2) INFORMATION' FOR SEO ID NO:23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 



TTGGGTAAAG GGA6CAAGGA GAAGGCATGG AGAGGCTCAN GCTGGTCCTG GCCTACGACT 60 
GGGCCAAGCT GTC6CCGGGG AT6GTGGAGA ACTGAAGCGG GACCTCCTCG AGGTCCTCCG 120 
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NCGTTACTTC NCCGTCCAGG AGGAGGGTCT TTCCGTGGTC TNGGAGGAGC GGGGGGA6AA 180 
GATNCTCCTC ATGGTCNACA TCCC 204 
(2) INFORMATION FOR SEQ ID NO; 24: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 264 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:24: 

TGGATTGGTC AGGAGCGGGT AGAGT66CAC CATTGAGGGG ATATTCAAAA ATATTATTTT 60 

GTCCTAAAT6 ATAGTTGCTG AGTTTTTCTT T6ACCCAT6A GTTATATTGG AGTTIATTTT 120 

TTAACTTTCC AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGAH NCTATTTHA 180 

TTAAAITGGA TTT6A6AAAT TG6TTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG 240 

ACATTATAGC TIAGTATGTG ACCA 264 
(2) INFORMATION FOR SEQ ID NO; 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGIH; 3/6 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
CO) TOPOLOGY; linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 
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TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG bO 

TGCACCCGCA ATCCCAGCTA CTTGGGAGGT TGAGACACAA GANTCACCTA NATGTGGGAG 120 

GTCAAGGTTG CATGAGTCAT GATTGTGCCA CTGCACTCCA GCCTGGGTGA CAGACCGAGA 180 

CCCTGCCTCA ANAGANAANG AATAG6AAGT TCAGAAATCN TGGNTGTGGN GCCCAGCAAT 240 

CTGCATCTAT NCAACCCCTG CAGGCAANGC TGATGCAGCC TANGTTCAAG AGCTGCTGTT 300 

TCTGGAGGCA GCAGTTNGGG CTTCCATCCA GTATCACGGC CACACTCGCA CNAGCCATCT 360 

GTCCTCCGTN TGTNAC 376 
(2) INFORMATION FOR SEQ ID N0:26: 

(1) SLQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 60 

TGCACCTGTA ATCCCAGCTA CTTG6GCGGC TGAGACACAA GAACCACCTA AAT6TGGGAG 120 

GGTCAAGGTT GCATGAGTCA TGATCGC6CC ACTGCACTCC AGCCTGGGTG ACAGACTGAG 180 

ACCCTGCCTC AAAAGAAAAA 6AATAG6AAG HCAGAAACC CTGGGTGTGG NGCCCAGCAA 240 

TCTGCATTTA AACAATCCCT GCAGGCAATG CTGATGCAGC CTAAGTTCAA GAGCTGCTGT 300 
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TCTGGAGGCA 6NAGTAA6GG CTTCCAICCA GCATCACGGN CAACACTGCA AAAGCACCTG 360 
TCCTCGTTGG TA 37^ 
(?) INKORMATION FOR StQ ID N0:27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEO ID N0:27: 

TTCTGTCCAC ATCTACAAGT TTTATTTATT TTGTGGGTTT TCAGGGTGAC TAAGTTTTTC 60 

CCTACATTGA AAAGAGAAGT TGCTAAAAGG TGCACAGGAA ATCATTTTTT TAAGTGAATA 120 

TGATAATATG GGTCCGTGCT TAATACAACT GAGACATATT TGTTCTCTGT TTTTTTAGAG 180 

TCACCICTTA AAGTCCAATC CCACAATGGT GAAAAAAAAA TAGAAAGTAT TTGTTCTACC 240 

TTTAAGGAGA CIGCAGGGAT TCICCTTGAA AAC6GAGTAT GGAATCAATC TTAAATAAAI 300 

ATGAAATTG6 TTGGTCTTCT GGGATAAGAA ATTCCCAACT CAGTGTGCTG AAATTCACCT 360 

GACTTTTTTT GGGAAAAAAT AGTCGAAAAT GTCAATTTGG TCCATAAAAT ACATGTTACT 420 

ATTAAAAGAT ATTTAAAGAC AAATTCTHC A6AGCTCTAA GATTGGTGTG GACAGAA 477 
(2) INFORMATION FOR SEQ ID N0:28: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 438 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

(xD SEQUENCE DESCRIPTION: SEQ ID N0:28: 

TCTNCAACCT CTTGANTGTC AAAAACCTTN TAGGCTATCT CTAAAAGC7G ACTGGTATTC 60 

ATTCCAGCAA AATCCCTCTA GTTTTTGGAG TTTCCTTTTA CTATCTGGGG CTGCCTGAGC 120 

CACAAATCCC AAATTAAGAG CATGGCTATT TTCGGGGGCT 6ACAG6TCAA AAGGGGTGTA 180 

AATCCGATAA GCCTCCTGGA GGTGCTCTAA AAACACICCT GGTGACTCAT CAT6CCCCT6 240 

GAC6ACTTCA ATCGNCTTAG ACAAGTTTAT AGGTTTCTG6 GCAGCTCCCT GAATACCCAC 300 

GAGGAGATAC CGGTGGAAAT CGTCAAAAGT TCTCCCTCCA CTTGAGAAAT TTGGGTCCCA 360 

ATTAGGTCCC AATTG6GTCT CTAATCACTA TTCCTCTAGC TTCCTCCTCC GGNCTATTGG 420 

TTGATGTGA6 GTTGAAGA 438 
(2) INFORMATION FOR SEQ ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XT) SEQUENCE DESCRIPTION; SEQ ID N0:29: 
AAGAGGGTAC CAGCCCCAA6 CCTTGACAAC TTCCATAGGG TGTCAAGCCT GTGGGTGCAC 60 
AGAAGTCAAA AATTGAGTTT TGGGATCCTC AGCCTAGATT TCAGA6GATA TAAAGAAACA 120 



NSOOCID: <WO__9725426A3_IB> 



wo 97/25426 PCT/US97/00485 

60 

CCTAACACCT AGATATTCAG ACAAAAGTTT ACTACAGGGA TGAAGCTTTC ACGGAAAACC 180 

TCTACTAGGA AAGTACAGAA GAGAAATGIG GGTTTGGAGC CCCCAAACAG AATCCCCTCT ?40 

AGAACACT6C CTAATGAAAC TGTGA6AAGA TGGCCACT6T CATCCAGACA CCAGAATGAT 300 

AGACCCACCA AAAACTTATQ CCATATTGCC TATAAAACCT ACAGACACTC AATGCCA6CC 360 

CCATGAAAAA AAAACTGAGA AGAAGACTGT NCCCTACAAT GCCACCGGAG CAGAACTGCC 420 

CCAGGCCATG GAAGCACAGC TCTTATATCA ATGTGACCTG GAIGTTGAGA CATGGAATCC 480 

NANGAAATCN TTTTAANACT TCCAC6GTTN AATGACTGCC CTATTANATT CNGAACTTAN 540 

ATCCNGGCCT 6T6ACCTCTT TGCTTTGGCC ATTCCCCCTT TTTGGAATGG CTNTTTTTTT 600 

CCCATGCCTG TNCCCTCTTA 620 

(2) INFORMATION FOR SEQ ID NO: 30: 

(1) SCQUENCt CHARACTERISTICS: 
(A) LFNGTH: 100 base pairs 
(R) TYPF; nucle-ic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEO ID NO: 30: 
TTACAACGAG GGGGTCAATG TCATAAATGT CACAATAAAA CAATCTCTTC TTTTHTTTT 60 
THTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 100 
(2) INFORMATION FOR SEQ ID N0:31: 



i 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:31: 

TAGTCTATGC GCCGGACAGA 6CAGAATTAA ATTGGAAGH GCCCTCCGGA CTTTCTACCC 60 

ACACTCTTCC TGAAAAGAGA AAGAAAAGAG 6CAGGAAAGA GGTTAGGATT TCATTTTCAA 120 

GAGTCAGCTA ATTAGGAGAG CAGAGTTTAG ACAGCAGTA6 GCACCCCAT6 ATACAAACCA 180 

TGGACAAAGT CCCTGTTTAG TAACTGCCAG ACATGATCCT GCTCAGGTTT TGAAATCTCT 240 

CTGCCCATAA AAGAT6GA6A GCAG6AGTGC CATCCACATC AACACGTGTC CAAGAAAGAG 300 

TCTCAG6GAG ACAAGGGTAT CAAAAAACAA GATTCTTAAT GGGAAGGAAA TCAAACCAAA 360 

AAATTAGATT TTTCTCTACA TATATATAAT ATACAGATAT TTAACACATT ATTCCAGAGG 420 

TGGCTCCA6T CCTTGGGGCT TGAGAGATGG TGAAAACTTT TGTTCCACAT TAACTTCTGC 480 

TCTCAAATTC TGAAGTATAT CAGAATGGGA CAGGCAATGT TTT6CTCCAC ACTGGGGCAC 540 

AGACCCAAAT GGTTCTGTGC CCGAAGAAGA GAAGCCCGAA AGACATGAAG GATGCTTAAG 600 

GGGGGTTGGG AAAGCCAAAT TGGTANTATC TTTTCCTCCT GCCTGTGTTC CNGAAGTCTC 660 

CNCTGAAGGA ATTCTTAAAA CCCTTTGTGA GGAAATGCCC CCTTACCATG ACAANTGGTC 720 

CCATTGCTTT TAGGGNGATG GAAACACCAA GGGTTTTGAT CC 762 
(2) INFORMATION FOR SEQ ID NO:32: 
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(1) StOUENCC CHARACTERISTICS: 

(A) LENGTH: 2/6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 32: 

1AGTCTATGC GTGTATTAAC CTCCCCTCCC TCAGTAACAA CCAAAGAGGC AGGAGCTGTT 60 

AIIACCAACC CCATTTIACA GATGCATCAA lAATGACAGA GAAGTGAAGI GACTTGCGCA 120 

CACAACCAGI AAATTGGCAG AGTCAGATH GAATCCATGG AGTCTGGTCT GCACTTTCAA 180 

TCACCGAATA CCCTTTCTAA GAAACGTGT6 C7GAATGAGT GCATGGATAA ATCAGT6TCT 240 

ACTCAACATC TITGCCTAGA TATCCCGCAT AGACTA 21f> 
(2) INFORMATION FOR SEQ ID N0:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47/ base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION; SEQ ID N0:33: 

TAGTAGTTGC CAAATATTTG AAAATTTACC CAGAAGTGAT TGAAAACTTT TTGGAAACAA 60 

AAACAAATAA AGCCAAAAGG TAAAAIAAAA ATATCTHGC ACTCTCGTTA TTACCTATCC 120 

ATAACTTTTT CACCGTAAGC TCTCCTGCTT GTTAGT6TAG TGTGGTTATA TTAAACTTTT 180 
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TAr,TTATTAT THTTATTCA CTTTTCCACT AGAAAGICAT TATTGATTTA GCACACATGT ?40 

TGATCTCATT TCATTHTTC TTTTTATAGG CAAAATHGA TGCTAT6CAA CAAAAATACT 300 

CAA6CCCATT ATCTTTTTTC CCCCCGAAAT CTGAAAATTG CAGGGGACAG AGGGAAGTTA 360 

TCCCATTAAA AAATTGTAAA TATGTTCAGT TTATGTTTAA AAATGCACAA AACATAAGAA 420 

AATTGTGTTT ACTTGAGCTG CTGATTGTAA GCAGTmAT CTCAGGGGCA ACTACTA 477 
(2) FNIFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNCSS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

TAGTAGTTGC CAATTCAGAI GATCAGAAAT GCTGCTTTCC TCAGCATTGT CHGTTAAAC 60 

CGCATGCCAT TTGGAACTTT GGCAGTGAGA AGCCAAAAGG AAGAGGTGAA TGACATATAT 120 

ATATATATAT ATTCAATGAA AGTAAAATGT ATAT6CTCAT ATACTTTCTA GTTATCAGAA 180 

TGAGTTAAGC TTTATGCCAT TGGGCTGCTG CATATTTTAA TCAGAAGATA AAAGAAAATC 240 

TGGGCATTTT TAGAATGTGA TACATGTTTT TTTAAAACTG TTAAATATTA TTTCGATATT 300 

TGTCTAAGAA CCGGAAT6TT CTTAAAATTT ACTAAAACAG TATTGHTGA GGAAGAGAAA 360 

ACTGTACTGT TTGCCATTAT TACAGTCGTA CAAGTGCATG TCAAGTCACC CACTCTCTCA 420 
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GGCATCAGIA TCCACCTCAT AGCTTTACAC ATTTTGACGG GGAATATTGC AGCATCCTCA 4B0 

GGCCTGACAT CT6GGAAAG6 CTCAGATCCA CCTACTGCTC CJTGCTCGTT GATTTGTTTT 540 

AAAATATTGT GCCTGGTGTC ACTTTTAA6C CACAGCCCTG CCTAAAAGCC AGCAGAGAAC 600 

A6AACCC6CA CCATTCTATA GGCAACTACT A 631 
(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTER I ST ICS: 

(A) LENGlll: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(XT) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TAGTAGTTGC CATCCCATAT TACAGAAGGC TCTGTAIACA TGACTTATTT GGAAGTGATC 60 

TGTTTTCTCT CCAAACCCAT TTATCGTAAT TTCACCAGTC HGGATCAAT CTTGGTTTCC 120 

ACTGATACCA TGAAACCTAC TTGGAGCAGA CATTGCACAG TTTTCTGTG6 TAAAAACTAA 180 

AGGTTTATTT GCTAAGCTGT CATCTTATGC TTA6TATTTT TTTTTTACAG TGGGGAATTG 240 

CTGAGATTAC ATTTTGTTAT TCATTAGATA CTTTGGGATA ACTTGACACT GTCTTCTTTl 300 

HTCGCTHT AATTGCTATC ATCATGCTTT TGAAACAAGA ACACATTAGT CCTCAAGTAT 360 

TACATAAGCT TGCTTGTTAC GCCTGGT66T TTAAA66ACT ATCTTT6GCC TCAGGTTCAC 420 

AAGAATGGGC AAAGTGTTTC CTTATGTTCT GTAGTTCTCA ATAAAAGATT GCCAGGGGCC 480 

GGGTACTGTG GCTCGCACTG TAATCCCAGC ACTTTGGGAA GCTGAGGCTG GCGGATCATG 540 
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TTAGGGCAGG IGTTCGAAAC CAGCCTGGGC AACTACIA 



578 



(2) INrORMATION FOR 5EQ ID NO: 36: 



i 



(1) SLQUENCL CHARACTERISTICS: 

(A) LENGTH: 583 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



TAGTAGTTGC CT6TAATCCC AGCAACTCAG GA6GCTGGGG CAGGAGAATC A6TTGAACCT 60 



GGGAG6CAGA AGTTGTAATT AGCAAA6ATC GCACCATTGC ACTTCAGCCT GGGCAACAAG 120 



AGTGAGATTC CATCTCAAAA ACAAAAAAAA GAAAAAGAAA AGAAAAGGAA AAAACGTATA 180 



AACCCAGCCA AAACAAAATG ATCATTCTTT TAATAAGCAA GACTAATTTA ATGTGTTTAT 240 



TTAATCAAAG CAGTTGAATC TTCTGAGTTA TTGGTGAAAA TACCCATGTA GTTAATTTAG 300 



GGTTCTTACT T6GGTGAACG TTTGATGTTC ACAGGTTA7A AAATGGTTAA CAAGGAAAAT 360 



GATGCATAAA GAATCTTATA AACTACTAAA AATAAATAAA ATATAAATGG ATAGGTGCTA 420 



TGGATGGAGT TTTTGTGTAA TTTAAAATCT TGAAGTCATT TTGGATGCTC ATTGCnGTC 480 



TGGTAATTTC CATTAGGAAA AGGTTATGAT ATGG6GAAAC TGTTTCTGGA AATTGCGGAA 540 



TGTTTCTCAT CTGTAAAATG CTAGTATCTC AGGGCAACTA CTA 



583 



(2) INFORMATION FOR SEQ ID NO: 37: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 base pairs 

(B) TVPL: nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:37: 

GATCTACTAG TCATNTGGAT TCTATCCATG GCAGCTAAGC CTTTCTGAAT GGATTCTACT 60 

GCTTTCTTGT TCTTTAATCC AGACCCTTAT ATATGTTTAI GTTCACAGGC A6GGCAATGT 120 

TTAGT6AAAA CAATTCTAAA TTTTTTATTT TGCATTTTCA TGCTAATTTC CGTCACACTC 180 

CAGCAGGCTT CCTGGGAGAA TAAGGAGAAA TACAGCTAAA GACATTGTCC CTGCTTACTT 240 

ACAGCCTAAT GGTATGCAAA ACCACTTCAA TAAAGTAACA G6AAAA6TAC TAACCAGGTA 300 

GAATGGACCA AAACTGATAT AGAAAAATCA GAGGAAGA6A GGAACAAATA TTTACT6AGT 360 

CCTAGAATGT ACAAGGCTTT TTAATTACAT ATTTTATGTA AGGCCTGCAA AAAACAGGTG 420 

AGTAATCAAC ATTTGTCCCA TTTTACATA1 AAGGAAACTG AAGCTTAAAT TGAATAATTT 480 

AATGCATAGA TTTTATAGTT A6ACCATGTT CAGGTCCCTA TGTTATACTT ACTAGCTGTA 540 

FGAATATGAG AAAATAATTT TGTTATTTTC TTGGCATCA6 TATTTTCAIC TGCAAAATAA 600 

AGCTAAAGTT ATTTAGCAAA CAGTCAGCAT AGTGCCTGAT ACATAGTAGG T6CTCCAAAC 660 

ATGATTACNC TANTATTNGG TATTANAAAA ATCCAATATA G6CNTGGATA AAACCG 716 
(2) INFORMATION FOR SEQ ID NO: 38: 
(1) SEQUENCE CHARACTERISTICS; 
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(A) LENGTH: 688 base pairs 

(B) TYPL: nucleic add 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TTCTGTCCAC ATATCATCCC ACTTTAATTG TTAATCAGCA AAACTTTCAA TGAAAAATCA 60 

TCCATTTTAA CCAGGATCAC ACCAGGAAAC TGAAG6T6TA TTTTTTTTTA CCTTAAAAAA 1?0 

AAAAAAAAAA ACCAAACAAA CCAAAACAGA TTAACAGCAA AGAGTTCTAA AAAATTTACA 180 

TTTCTCTTAC AACTGTCATT CAGAGAACAA TAGTTCTTAA GTCTGTTAAA TCTTGGCATT 2^0 

AACAGAGAAA CTTGATGAAN AGTTGTACTT GGAATATTGT GGATTTTTTT TTTTGTCTAA 300 

TCTCCCCCTA TTGTTTTGCC AACAGTAATT TAAGTTTGTG TGGAACATCC CCGTAGTTGA 360 

AGTGTAAACA ATGTATAGGA AGGAATATAT GATAAGATGA TGCATCACAT ATGCATTACA 420 

TGTAGGGACC TTCACAACTT CATGCACTCA GAAAACATGC TTGAAGAGGA GGAGAG6ACG 480 

GCCCAGGGTC ACCATCCAGG TGCCTTGAGG ACAGAGAATG CAGAAGTGGC ACTGTTGAAA 540 

TTTAGAAGAC CATGT6TGAA TGGTTTCAGG CCTGGGATGT TTGCCACCAA GAAGTGCCTC 600 

CGAGAAATTT CTTTCCCATT TGGAATACAG GGTGGCTTGA TG6GTACGGT GGGTGACCCA 660 

ACGAAGAAAA TGAAATTCTG CCCTTTCC 688 

(2) INFORMATION FOR SEQ ID NO: 39: 

(1) SEQUENCE CHARACTERISTICS; 
CA) LENGTH: 585 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:39: 

TAGTAGTTGC CGCNNACCTA AAANTTGGAA AGCATGATGT CTAGGAAACA TANTAAAAIA 60 

GGGTATGCCT ATGTGCTACA 6AGAGATGTT AGCATTTAAA GTGCATANTT TTATGTATTT 120 

TGACAAAFGC ATATNCCTCT ATAATCCACA ACTGATTACG AAGCIATTAC AATTAAAAAG 180 

TTT6GCCGGG CGTGGTGGGC GGTGGCT6AC GCCT6TAATC CCAGCACTTI GG6AGGCCGA 240 

G6CAC6C6GA TCACGAGGTC GG6AGTTCAA GACCATCCTG GCTAACACGG TGAAAGTCCA 300 

TCTCTACTAA AAATACGAAA AAATTACCCC GGCGTGGTGG CGGGCGCCTG TAGTCCCAGC 360 

TACTCCGGAG GC1GAGGCAG GAGAATGGC6 TGAACCCAGG ACACGGAGCT TGCAGTGTGC 420 

CAACATCACG TCACTGCCCT CCAGCCTGGG GGACAGGAAC AAGANTCCCG TCCTCANAAA 480 

AGAAAAATAC "lACTNATANT TTCNACTTTA TTTTAANTTA CACAGAACTN CCTCTTGGTA 540 

CCCCCTTACC ATTCATCTCA CCCACCTCCT ATAGGGCACN NCTAA 585 
(2) INFORMATION FOR SEQ ID NI0:40: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CO) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

ICTGTCCACA CCAATCTTAG AAGCTCTGAA AAGAATTTGT CTTTAAATAT CTTTTAATAG 60 

lAACATGTAT TTTATGGACC AAATTGACAT TTTC6ACTGT TTTTTCCAAA AAAGTCAGGT 120 

GAATTTCAGC ACACTGAGTT GGGAATTTCT TATCCCAGAA GACCAACCAA TTTCATATTT 180 

ATTTAAGATT GATTCCATAC TCCGTTTTCA AGGAGAAfCC CTGCAGTCTC CTTAAAGGTA 240 

GAACAAAIAC TTCCTATTTT TTTTTCACCA TTGTGGGATT GGACTTTAAG AGGTGACTCT 300 

AAAAAAACAG AGAACAAATA TGTCTCAGTT GTATTAAGCA CGGACCCATA TTATCATATT 360 

CACTTAAAAA AATGATTTCC TGT6CACCTT TTGGCAACTT CTCTTTTCAA T6TAGGGAAA 420 

AACTTAGTCA CCCTGAAAAC CCACAAAATA AATAAAACTT GTAGATGTGG ACAGA 475 
(2) INFORMATION FOR SEQ ID N0:41: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 41: 

TAAGAGGGTA CATCGGGTAA GAACGTAGGC ACATCTAGAG CTTAGAGAAG TCTGGGGTAG 60 

GAAAAAAATC TAAGTATTTA TAA6GGTATA GGTAACATTT AAAAGTAGGG CTAGCTGACA 120 

TTATTTAGAA AGAACACATA CGGAGAGATA AGGGCAAAGG ACTAAGACCA GAGGAACACT 180 

AATATTTAGT GATCACTTCC ATTCTTGGTA AAAATAGTAA CTTTTAAGTT AGCTTCAAGG 240 
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AAGATTTTTG GCCATGATTA GTTGTCAAAA 6TTAGTTCTC TTGGGTTTAT ATTACTAATT 300 
TTGTTTTAAG ATCCTTGTTA GTGCTTTAAT AAAGTCATGT TATATCAAAC GCTCTAAAAC 360 
ATTGTAGCAT GTTAAATGTC ACAATATACT TACCATTTGT TGTATATGGC TGTACCCTCT 420 
CTA 

(2) INFORMATION FOR SEO lU N0:42; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: 5EQ ID N0:42: 

TC ICCTAGGC lAATGTGTGT GTTTCTGTAA AAGIAAAAAG TTAAAAATTT TAAAAATAGA 60 

AAAAAGCTTA TAGAATAAGA ATATGAAGAA AGAAAATATT TTTGTACATT TGCACAATGA 120 

GTTTATGTTT TAAGCTAA6T GTTATTACAA AAGAGCCAAA AAGGTTTTAA AAATTAAAAC 180 

GTTTGTAAAG TTACAGFACC CTTATGTTAA TTTATAATTG AAGAAAGAAA AACTTTTTTT 240 

TATAAATGTA GTGTAGCCTA AGCATACAGT ATTTATAAAG TCTGGCAGTG TTCAATAATG 300 

TCCTAGGCCT fCACATTCAC TCACTGACTC ACCCAGA6CA ACTTCCAGTC CTGTAAGCTC 360 

CATTCGTGGf AAGTGCCCTA TACAGGTGCA CCATTTATIT TACAGTATTT TTACTGTACC 420 

TTCTCTATGT TTCCATATGT TTCGATATAC AAATACCACT GGTTACTATN GCCCNACAGG 480 
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TAATTCCAGl AACACGGCCT GTAIACGTCT GGTANCCCTA GNGAAGA 5?7 
(2) INFORMATION FOR SEQ ID N0:43: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 331 base pairs 

(B) lYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY; linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:43: 

TCTTCAACCT CGTAGGACAA CTCTCATATG CCTGGGCACT ATTTTTAGGT TACTACCTTG 60 

GCTGCCCTTC TTTAAGAAAA AAAAAAGAAG AAAAAAGAAC TTTTCCACAA GTHCTCTTC 120 

CTCTAGTTGG AAAATTAGAG AAATCATGTT TTTAATTTTG TGHATHCA 6ATCACAAAT 180 

TCAAACACTT GTAAACATTA AGCTTCTGTT CAATCCCCTG G6AAGAGGAT TCATTCTGAT 240 

ATTTACGGTT CAAAA6AAGT TGTAATATTG TGCTTGGAAC ACAGA6AACC AGTTATTAAC 300 

TICCTACTAC TATTATATAA TAAATAATAA C 331 

(?) INFORMATION FOR SEQ ID N0:44: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 592 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
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GGCTTAGTAG TTGCCAGGCA AAATARCGTT GATTCTCCTC AGGAGCCACC CCCAACACCC 60 

CTGTTTGCTT CTAGACC1AT ACCTAGACTA AAGICCCAGC AGACCCCTAG AGGTGAGGT1 120 

CAGAGTGACC CTTGAGGAGA TGTGCTACAC TAGAAAAGAA CTGCTT6AGT TTTCTAATTT 180 

AlAIAAGCAG AAATCTGGAG AAGAGTCATA G6AATGGATA TTAAGGGTGT GAGATAATGG 240 

CGGAAGGAAT ATAGAGTT6G ATCAGGCTGG ACTTATTGAT TTGAACCCAC TAAGTAGAGA 300 

TTCTGCTTTT GATGTTGCAG CICAGGGAGT TAAAAAAGGT TTTAATGGTT CTAATAGTTT 360 

ATTTGCTTGG TTAGCTGAAA TATGGATAAA AGATGGCCCA CTGTGAGCAA GCTGGAAATG 420 

CCTGATCTCT CTCAGTTTAA TGTAGAGGAA G6GATCCAAA AGTTTAGGGA GANTTGGAT6 480 

CTGGRAKTGG ATTGGTCACT TTGRGACCIA CCCWTCCCAG C IG6GAGGGT CCAGAAGATA 540 

CACCCTTGAC CAACGCTTTG CGAAATGGAT TTGTGATGGC GGCAACIACT AA 592 
(2) INFORMATION FOR SEQ ID N0:4b: 

(1) SLQUtNCE CHARACTERISTICS: 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

GGCTTAGTAG TTGCCATTGC GAGTGCTTGC TCAACGAGCG TTGAACATGG CGGATTGTCT 60 

AGATICAACG GATTTGAGTT TTACCAGCAA AGCGAACCAA GCGCGGCCCA GAGAATTATG 120 

GGTTGGTTGG CTTTGAAAAG ATGGAAATCC TGTAGGCCTA GTCAGAAAAG CCTTCTTGCA 180 
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GAACAGTTCG TTCTCGGGCG AACGCTCATC AAGATGCCCA TT6GAAAGGC TAGCGTGTAT 240 

TTGGGAGAGC CTGATAGCGT GTCTTCTGAT GATGHTGIG CTTGGACAGT GACAAAAGAT 300 

ATGCAAAGCA AGTCCGAACT AGACGTCAAG CTTCGTGA6C AAATTATT6T A6ACTCCTAC 360 

TTATACTGT6 AGGAATGATA GCCAAGG6TG GG6ACTTTAA GACTAAGGTG GTTTGIACTI 4?0 

GCGCCGATGA TCCCAGGCAG AAAGAMCTGA TCGCTAGTTT TATACGGGCA ACTACTAAGC 480 

C6AA1TCCAG CACACTGGCG GCCGTTACTA ATTGGATCCG ANCTCGGTAC CAGCTTGATG 540 

CATASCTTGA GTTWTCTATA NTGTCNC 567 
(2) INFORMATION FOR SEQ 10 NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 908 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: 5E0 ID N0:46: 

GAGCGAAAGA CCGAGGGCAG NGNNTANGNG CGANGAAGCG GAGAGGGCCA AAAAGCAACC 60 

GCTTTCCCCG GGGGGTGCCG ATTCATTAAG GCAGGTGGAG GACAGGTTTC CCGATGGAAG 120 

GC6GCAGGGG CGCAAGCAAT TAATGT6AGT AGGCCATTCA TTAGCACCCG GGCTTAACAT 180 

TTAAGCTTCG GGTTGGTATG TGGTGGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA 240 

CAGCTATGAC CATGATTACG CCAAGCTATT TAGGTGACAT TATAGAATAA CTCAAGTTAT 300 



INSDOCID: <W0 972542eA3JB> 



wo 97/25426 PCT/US97/00485 

74 

GCATCAAGCT TGGTACCGAG TTC6GATCCA CTAGTAACGG CCGCCAGTGT GTGGAATTCG 360 

GCTTAGTAGT TGCCGACCAT GGAGTGCTAC CTAGGCIAGA ATACCTGAGY TCCTCCCTAG m 

CCTCACTCAC ATTAAATTGl ATCTTTTCTA CATTAGATGT CCTCAGC6CC TTATTTCTGC 480 

TGGACWATCG ATAAAHAAT CCTGATAGGA TGATAGCAGC AGATTAATTA CTGAGAGTAT 540 

GTTAATGTGl CATCCCTCCT AUTAACGTA TTTGCATTTT AATGGAGCAA TTCTGGAGAT 600 

AATCCCTGAA GGCAAAGGAA TGAATCTTGA GGGTGAGAAA GCCAGAATCA GTGTCCAGCT 660 

GCAGTTGTGG GAGAAGGTGA TATTATGTAT GTCTCAGAAG TGACACCATA TGG6CAACTA 720 

CTAAGCCCGA ATTCCA6CAC ACTGGCGGGC GTTACTAATG GATCCGACCT CGGTACCAAG 780 

CTTGATGCAT AGCTTGAGTA TCTATAGTGT CACTAAATA6 CCTGGCGTTA TCATGGTCAl 840 

AGCTGTTTCC TGTGTGAAAT TGTTATCCGC TCCCAATTCC CCCCACCATA CGAGCCGGAA 900 

CATAAAGT 908 
(?) INFORMATION f-OR SEO ID N0:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SFQ ID N0:47: 

TGCCAACAAG GAAAGTTTTA AATTTCCCCT TGAGGATTCT TGGTGATCAT CAAATTCAGl 60 

GGTTTTTAAG GTTGTTTTCT GTCAAATAAC TCTAACTTTA AGCCAAACAG TATATGGAAG 1?0 
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CACAC^ATAKA ATATTACACA GA7AAAAGAG GAGTTGATCT AAAGTARAGA TA6TTGGGGG 180 

CTTTAATTTC TGGAACCTAG GTCTCCCCAT CTTCTTCTGT GCTGAGGAAC TTCTTGGAAG ?40 

CGGGGATTCT AAAGTTCTTT GGAAGACAGT TTGAAAACCA CCATGTTGTT CTCAGTACCl 300 

TTATTTTTAA AAAGTAGGTG AACATTTTGA GAGAGAAAAG 6GCTTGGTTG AGATGAAGTC 360 

CCCCCCCCCC CTTTHTTTT TTTTAGCTGA AATA6ATACC CTATGT[NAA RGAARGGATT 420 

ATTATTTACC ATGCCAYTAR SCACATGCTC TTTGATGGGC NYCTCCSTAC CCTCCTTAAG 480 

(2) INrORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:48: 

AAGAGGGTAC CGAGTGGAAT TTCCGCTTCA CTAGTCT6GT GTGGCTAGTC GGTTTCGTGG 60 

TGGCCAACAT TACGAACTTC CAACTCAACC GTTCTT6GAC GTTCAAGCGG GAGTACCGGC 120 

GAGGATGGTG GCGTGAATTC TGGCCTTTCT TTGCCGTGGG ATCGGTAGCC GCCATCATCG 180 

GTATGTTTAT CAAGATCTTC TTTACTAACC CGACCTCTCC GATTTACCTG CCC6AGCCGT 240 

GGTTTAACGA GG6GAGGGGG ATCCAGTCAC GCGAGTACTG GTCCCAGATC TTCGCCATCG 300 

TCGTGACAAT GCCTATCAAC TTCGTCGTCA ATAAGTTGTG GACCTTCCGA ACGGT6AAGC 360 
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ACTCCGAAAA CGTCCGGTGG CTGCTGTGCG GTGACTCCCA AAATCTTGAT AACAACAAGG 420 

TAACCGAATC GCGCTAAGGA ACCCCGGCAT CTCGGGTACT CTGCATATGC GTACCCCTTA 480 

I AGCCGAATTC CAGCACACTG GCGGCCGTTA CTAATTGGAT CCGAACTCCG TAACCAAGCC 540 

TGATGCGIAA CTTGAGTTAT TCTATAGTGT CCCIAAAAIA ACCT6GCGTT A 591 

(2) INFORMATION FOR SFQ ID NO: 49: 

(1) SEQUENCt CHARACTERISTICS: 
(A) LENGTH: 454 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
I (D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:49: 

AAGAGGGTAC CTGCCTTGAA ATTTAAATGT CTAAGGAAAR TGG6AGATGA TTAAGAGTTG 60 

GTGTGGCYTA GTCACACCAA AATGIATTIA TTACATCCTG CTCCTTTCIA GTTGACAGGA 120 

AAGAAAGCTG CTGTGGGGAA AGGAGGGATA AATACTGAAG GGATTTACIA AACAAATGTC 180 

CAFCACAGAG TTTTCCTTTT HTTTTTTTG AGACAGAGTC TTGCTCTGTC ACCCAGGCTG 240 

GAATGAAGWG GTATGATCTC AGTTGAATGC AACCTCTACC TCCTAGGTTC AAGCGAHCT 300 

i CATGCCTCAG CCTCCTGAGC AGCTGGGACT ATAGGCGCAT GCTACCATGC CAGGCTAATl 360 

TTIATATTTT TATTAGAGAC GGGGTGTTGC CATGTTGGCC AGGCAGGTCT CGAACTCCTG 420 

GGCCTCAGAT GATCTGCCCC ACCGTACCCT CTIA 454 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNtSS: single 
CD) TOPOLOGY: "hnear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:50: 

AAGAGGGTAC CAAAAAAAA6 AAAAAGGAAA AAAAGAAAAA CAACTTGTAl AAGGCTTTCT 60 

GC7GCATACA GCTTTTTTTT TTTAAATAAA TGGTGCCAAC AAATGTTTTT GCATTCACAC 120 

CAATTGCT6G THTGAAATC GTACTCTTCA AAGGTATTTG TGCAGATCAA TCCAATAGTG 180 

ATGCCCCGTA GGTTTTGTGG ACTGCCCACG TTGTCTACCT TCTCATGTAG GAGCCATTGA 240 

GAGACTGTTT GGACATGCCT GTGTTCATGT AGCCGT6ATG TCCGGGGGCC GTGTACATCA 300 

TGTTACCGTG GGGTGGGGTC TGCATTGGCT GCTGGGCATA TGGCTGGGTG CCCATCATGC 360 

CCATCTGCAT CTGCATAGGG TATT6GGGCG TTTGATCCAT ATA6CCATGA TTGCTGlGGi 420 

AGCCACTGTT CATCATTGGC TGGGACATGC TGTTACCCTC TTA 463 
(2) INFORMATION FOR SEO ID N0:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



I 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO:bl: 
CTTCAACCTC CCAAAGTGCT GGGATTACAG GACTGAGCCA CCAC6CTCAG CCTAAGCCTC 60 

TTTTTCACTA CCCTCTAAGC GATCTACCAC AGTGAT6AGG GGCTAAAGAG CAGTGCAATT 120 

TGATTACAAT AATG6AACTT AGATTTAIIA ATTAACAATT TTTCCTTA6C ATGTTGGnC 180 

CATAATTATT AAGAGTATGG ACTTACTTAG AAATGAGCTT TCATTTTAAG AATTTCATCT 240 

TTGACCTTCT CTATTAGTCT GA6CAGTATG ACACTATACG TATTTTAITT AACTAACCTA 300 

CCFTGAGCTA TTACTTTTTA AAAGGCTATA TACATGAATG TGTATT6TCA ACTGTAAAGC 360 

CCCACAGTAT TTAATTATAT CATGATGTCT TTGAGGTTG 399 
(?) INFORMATION FOR SEO ID NO: 52: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:52: 

CTTCAACCTC AATCAACCTT GGTAATT6AT AAAATCATCA CTTAACTTTC TGATATAATG 60 

GCAATAAKA TCTGAGAAAA AAAAGTGGTG AAAGATTAAA CTTGCATTTC TCTCAGAATC 120 

TTGAAGGATA TTTGAATAAT TCAAAAGCGG AATCAGTAGT ATCAGCCGAA 6AAACTCACT 180 

TAGCTAGAAC GTTGGACCCA TGGATCTAAG TCCCTGCCCT TCCACTAACC AGCTGATTGG 240 

nTTGTGlAA ACCTCCTACA CGCTTGGGCT TG6TCGCCTC ATTTGTCAAA GTAAAGGCTG 300 
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AAATAGGAAG ATAATGAACC GTGTCTTTTT GGTCTCTTTT CCATCCATTA CTCTGATTTT 360 
ACAAAGAG6C CTGTATTCCC CTGGTGAGGT TG 392 
{?.) INFORMATION FOR SEQ ID NO: 53; 

i^) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:53: 

I 

TTC6GGTGAT GCCTCCTCAG GCTACAGTGA AGACTG6ATT ACAGAAAGGT 6CCAGCGAGA 60 
TTTCAGATTC CTGTAAACCT CTAAAGAAAA GGAGTCGCGC CTCAACTGAT GTAGAAATGA 120 
CTAGTTCAGC ATACNGAGAC ACNTCTGACT CCGATTCTAG AG6ACTGAGT 6ACCTGCAN 179 
(?) INFORMATION FOR SEQ ID NO:54: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
TTCGGGTGAT GCCTCCTCAG GCTACATCAT NATAGAAGCA AAGTAGAANA ATCNNGTTTG 60 
TGCATTTTCC CACANACAAA ATTCAAAT6A NTGGAAGAAA TTGGGANAGT AT 112 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNCSS: single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:55: 
TGAGCTTCC6 CTTCTGACAA CTCAATAGAT AATCAAAGGA CAACTTTAAC AGGGAHCAC 
AAAGGAGTAT ATCCAAATGC CAATAAACAT ATAAAAAGGA ATTCAGCTIC ATCATCATCA 
GAAGWATGCA AATTAAAACC ATAATGA6AA ACCACTATGT CCCACTAGAA TAGATAAAAT 
CTTAAAAGAC TGGTAAAACC AAGTGTTGGT AAGGCAAGAG GAGCA 
(2) INFORMATION FOR SEQ ID N0:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GCTCCTCTTG CCTTACCAAC ACATTCTCAA AAACCTGTTA GAGTCCTAAG CATTCTCCTG 
TTAGTATTGG GATTTTACCC CTGTCCTATA AAGATGTTAT GTACCAAAAA TGAAGTGGA6 
GGCCATACCC TGAGGGAGGG GAGGGATCTC TAGTGTTGTC AGAAGCGGAA GCTCA 
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(2) INFORMATION FOR SCO ID NO: 57: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO; 57: 

AGCCATTTAC CACCCATGGA TGAATGGATT TTGTAATTCT AGCTGTTGTA TTTT6TGAAT 60 

TTGTTAATTT TGTTGTTTTT CTGTGAAACA CATACATTGG ATATGGGAGG TAAA6GAGTG 120 

TCCCAGTTGC TCCTGGTCAC TCCCTTTATA GCCATTACTG TCTTGTTTCT TGTAACTCAG J80 

GTTAGGTTTT GGTCTCTCTT GCTCCACTGC AAAAAAAAAA AAA 223 
(?) INFORMATION FOR SEQ ID N0:58:' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi; SEQUENCE DESCRIPTION: SEQ ID N0:58: 

GTTCGAAGCT GAACGTGTA6 GTAGCGGATC TCACAACTGG GGAACTGTCA AAGACGAATT 60 

AACTGACTTG GATCAATCAA ATGT6ACTGA GGAAACACCT GAAGGTGAAG AACATCATCC 120 

AGTGGCAGAC ACT6AAAATA AGGAGAATGA AGTTGAAGAG GTAAAAGAGG AGGGTCCAAA 180 
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AGAGATGACT TFGGATGGGT GGTAAATG6C T 
(2) INFORMATION FOR SEO ID N0:59: 

(i) SEOUtNCE CHARACTERISTICS; 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



211 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:59; 

GCTCCTCTTG CCTTACCAAC TTTGCACCCA TCATCAACCA TGTGGCCA6G TTTGCAGCCC 60 

AGGCTGCACA TCAGGGGACT GCCTCGCAAT ACTTCATGCT GTTGCTGCTG ACTGATGGTG 120 

CTGIGACGGA TGTGGAAGCC ACACGTGAGG CTGTGGTGCG T6CCTCGAAC CTGCCCATGT 180 

CAGT6ATCAT TATGGGTGGT AAATGGCT 208 
(2) INFORMATION FOR SEO ID NO:60: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:60: 
AGCCATTTAC CACCCATACT AAATTCTAGT ICAAACTCCA ACTTCTTCCA TAAAACATCT 60 
AACCACTGAC ACCAGTTGGC AATAGCTTCT TCCTTCHTA ACCTCTTAGA GTATTTAT6G 120 
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TCAATGCCAC ACATTTCTGC AACT6AATAA AGTTGGTAA6 GCAAGAGGAG C 171 
(2) INFORMATION FOR SEC ID N0:6]: 

(1) SFQUENCE CHARACTERISTICS; 
CA) LENGTH: 134 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEONESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CGGGTGATGC CTCCTCAGGC TTTGGTGTGT CCACTCNAC1 CACTGGCCTC TTCTCCAGCA 60 
ACTGGTGAAN ATGTCCTCAN GAAAANCNCC ACACGCNGCT CAGGGTGGG6 TGGGAANCAl 120 
CANAATCATC NGGC 1 34 

(?) INFORMATION FOR SEQ ID NO; 62: 

(1) SEQUENCE CfiARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:62: 

AGAGGGTACA TATGCAACAG TATATAAAGG AAGAAGTGCA CTGAGAGGAA CTTCATCAAG 60 

GCCATTTAAT CAATAAGTGA TAGAGTCAAG 6CTCAACCCA 6GTGTGACGG ATTCCAGGTC 120 

CCAAGCTCCT TACT6GTACC CTCTT 1^5 
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(2) INFORMATION FOR SFQ ID NO: 63: 

{^) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:63: 

TGCACT6AGA GGAATTCAAA GGGTTTAT6C CAAAGAACAA ACCAGTCCTC TGCAGCCTAA 60 

CTCATTTGTT TTTGGGCTGC GAAGCCATGT AGAG6GCGAT CAGGCAGTAG ATGGTCCCTC 120 

CCACAGTCAG CGCCATGGTG GTCCGGTAAA GCATTTGGTC AGGCAGGCCl CGTTTCAGGT 180 

A6ACGGGCAC ACATCAGCTT TCTGGAAAAA CTTTTGTAGC TCTGGAGCTT TGTTTTTCCC 240 

AGCATAATCA TACACTGTGG AATC6GA6GT " CAGTTTA6TT GGTAAGGCAA 6AG6AGC 297 

(2) INFORMATION hOR SEQ ID NO: 64: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 300 base pairs 
(R) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEO 10 N0:64: 
GCACTGAGAG 6AACTTCCAA TACTATGTTG AATAG6AGTG GTGAGA6AGG GCATCCTTCT 60 
CTTGTGCCGG TTTTCAAAGG GAATGCTTCC AGCTTTTGCC CATTCAGTAT AATATTAAAG 120 
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MTGTTTTAC CATTTTCTGT CTTGCCTGTT TTTCTGTGTT TTTGTTGGTC TCITCATTCT 



180 



I CCATTTTTAG GCCTTTACAT GTTAGGAATA TATTTCTTTT AATGA7ACTT CACCTTTGGI 240 

I ATCTTTTGTG AQACTCTACT CATAGTGTGA TAAGCACTGG GTTGGTAAGG CAAGAGGAGC 300 



(2) INFORMATION FOR SEO ID N0:65: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPaOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:55: 

GCTCCTCTTG CCHACCAAC TCACCCAGTA TGTCAGCAAT THATCRGCT TTACCTAC6A 60 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATCGCCTT 120 

CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CTTTCTCTTT TCCTAGAACA T6CATTTARG 180 

TC6ATAGAAG TTCCTCTCAG TGC 203 
(2) INFORMATION FOR SEQ ID N0:66: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(XT) SEQUFNCt DESCRIPTION: SLQ ID N0:66; 

TACGGGGACC CCTGCATT6A 6AAAGCGAGA CTCACTCTGA AGCTGAAATG CTGTTGCCCT 60 

TGCAGTGCTG GTAGCAGGAG TTCTGTGCTT TGTGGGCiAA GGCTCCTG6A TGACCCCTGA 120 

CATGGAGAAG GCAGAGTTGT GTGCCCCTTC TCATGGCCTC GTCAAGGCAT CATGGACTGC 180 

CACACACAAA AT6CCGTTTT TATTAACGAC AT6AAATTGA A6GAGAGAAC ACAATTCACT 240 

6ATGTGGCTC GTAACCATGG ATATGGTCAC ATACAGAGGT GTGATTATGT AAAGGTTAAT 300 

TCCACCCACC TCATGTGGAA ACTAGCCTCA ATGCAGGGGT CCCA 34.1 
(2) INFORMATION FOR SEO ID NO: 67; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
GCACTGAGAG GAACTTCGTA GGGAGGTTGA ACTGGCTGCT GA6GAGGGGG AACAACAGGG 60 
TAACCAGACT GATAGCCATl GGATGGATAA TATGGTGGTT GA6GAGG6AC ACTACTTATA 120 
GCAGAGGGTT GT6TATAGCC TGAGGAGGCA TCACCCG 157 
(2) INEORMATION FOR SEO ID N0;68: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDFDNESS: single 

(D) TOPOLOGY: linear 



(XI) StOUENCE DESCRIPTION: SEQ ID N0:68: 
GCACTGAGA6 GAACTTCTA6 AAAGTGAAAG TCTAGACATA AAATAAAATA AAAATTTAAA 60 
ACTCAGGAGA 6ACAGCCCAG CACGGTGGCT CACGCCTGTA ATCCCAGAAC TTTGG6AGCC 120 
T6AGGAGGCA TCACCC6 
(2) INFORMATION FOR SEO ID NO: 69: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:69: 
CGGGTGATGC CTCCTCAGGC TGTATTTTGA AGACTATCGA CTGGACTTCT TATCAACTGA 60 
AGAATCCGTT AAAAATACCA GTTGTATTAT TTCTACCTGT CAAAATCCAT TTCAAATGTT 120 
GAAGTTCCTC TCAGTGC 137 
(2) INFORMATION FOR SEQ ID NO: 70: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO; 70: 

AGCATGTTGA GCCCAGACAC GCAATCTGAA TGAGT6TGCA CCTCAAGTAA ATGTCTACAC 60 

GCTGCCTGGT CTGACATGGC ACACCATCNC GTGGAGGGCA CASCTCTGCT CNGCCTACWA 120 

CGAGG6CANT CTCATWGACA GGTTCCACCC ACCAAACTGC AAGAGGCTCA NNAAGTACTR 180 

CCAGGGTMYA SG6ACMASGG TGGGAYTYCA YCACWCATCT 220 
(2) INFORMATION FOR SEQ ID N0:71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CGTTAGGGTC TCTATCCACT GCTAAACCAT ACACCTG6GT AAACAGGGAC CATTTAACAT 60 

TCCCANCTAA ATAT6CCAAG TGACTICACA TGTTTATCTT AAAGATGTCC AAAACGCAAC 120 

T6ATTTTCTC CCCTAAACCT GTGATGGTGG GATGATTAAN CCTGAGTGGT CTACAGCAAG 180 

TIAAGTGCAA GGTGCJAAAT GAANGTGACC TGAGATACAG CATCTACAAG GCAGTACCTC 240 

TCAACNCAGG GCAACTTTGC TTCTCANAGG GCATTTAGCA GTGTCTGAAG TAATHCTGT 300 

ATTACAACTC ACGGGGCGGG GGGTGAATAT CTANTGGANA GNAGACCCTA ACG 3b3 
(2) INFORMATION FOR SEQ ID NO:72: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION; SEQ ID N0:72: 

GCACTGAGAG GAACTTCCAA TACYATKATC AGAGTGAACA RGCARCCYAC AGAACAGGAG 60 

AAAATGTTYG CAATCTCTCC ATCTGACAAA AGGCTAATAT CCAGAWTCTA AWA6GAACTT 120 

AAACAAATTT ATGAGAAAA6 AACARACAAC CTCAWCAAAA AGTGGGTGAA GGAWATGCTS 180 

AAARGAAGAC ATYTATTCAG CCAGTAAACA YATGAAAAAA AGGCTCATSA TCACTGAWCA 240 

TTAGAGAAAT GCAAATCAAA ACCACAATGA GATACCATCT YAYRCCAGTT AGAAYGGTGA 300 

TCATTAAAAR STCAGGAAAC AACAGATGCT GGACAAGGTG TCA 343 
{?) INFORMATION FOR SEO ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEO ID NO:73: 
GCACTGAGAG GAACTTCAGA GA6AGAGAGA GAGTTCCACC CTGTACTTGG GGAGAGAAAC 60 
AGAAGGTGAG AAAGTCTTTG 6TTCT6AAGC AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 120 
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TCAAAGTTCC CATGCTGCCA AAGTGCCATC CTTTGGGGTA CTGTTTTCTG AGCTCCAGTG 180 

ATAACTCATT TATACAAG6G AGATACCCAG AAAAAAAG7G AGCAAATGTT AAAAAGGTGG 240 

CTTGAGTTCA GCCTTAAATA CCATCTTGAA ATGACACAGA GAAAGAANGA TGTTGGGTG6 300 

GAGTGGATAG AGACCCTAAC G 321 
(2) INFORMATION FOR SEQ ID NO: 74: 

(1) SCQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:74; 

GCACTGA6AG GAACTTCAGA GAGAGAGAGA 6AGTTCCACC CTGTACTTGG GGAGA6AAAC 60 

AGAA6GTGAG AAAGTCTllG GTTCTGAAGC AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 120 

TCAAAGTTCC CATGCTGCCA AAGTGCCATC CTTTGGGGIA CTGTTTTCTG AGCTCCAGTG 180 

ATAACTCATT TATACAAGGG AGATACCCAG AAAAAAAGTG AGCAAATCTT AAAAAGGTGG 240 

CTTGAGTTCA GYCTTAAATA CCATCTTGAA ATGAMACAGA GAAAGAAG6A TGTTGGGTGG 300 

GAGTGGATAG AGACCCTAAC G 321 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 317 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:75: 



GCACTGAGAG 6AACTTCCAC ATGCACTGAG AAATGCATGT TCACAAGGAC T6AAGTCTGG 
AACTCAGTTT CTCAGTTCCA ATCCT6ATTC AGGTGTHAC CAGCTACACA ACCTTAA6CA 
AGTCAGAIAA CCTTAGCTTC CTCATATGCA AAATGA6AAT GAAAAGTACT CATC6CTGAA 
TTGTTTTGAG GATTAGAAAA ACATCTG6CA TGCAGTAGAA ATTCAATTAG TATTCATTTT 
CATTCTTCTA AATTAAACAA ATAGGATTTT TAGTGGTG6A ACHCAGACA CCAGAAATGG 
GAGTGGATAG AGACCCT 



(?) INFORMATION FOR SEQ ID N0:76: 



(1) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEO ID N0:76: 



CGTTAGGGTC TCTATCCACT CCCACTACTG ATCAAACTCT ATTTATTTAA TTATrfTTAT 
CATACTTTAA GTTCTGGGAT ACACGTGCAG CATGC6CAGG TTTGTTGCAT AGGTATACAC 
TTGCCATGGT GGTTTGCTGC ACCCATCAGT CCATCATCTA CATTAGGTAT TTCTCCTAAT 
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GCTATCCCTC CCCTAGCCCC TTACACCCCC AACAGGCTCT AGTGTGTGAA GTTCCTCTCA ?A(} 
GTGC ^^44 
(2) INFORMATION FOR SEQ ID N0:77: 

(1) SEOUtNCC CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SFQ ID NO;"//: 

CGTTAGG6TC TCTATCCACT GAAATCTGAA GCACAGGAGG AAGAGAAGCA 6TYCTAGTGA 60 

GATGGCAAGT TCWTTTACCA CACTCTTTAA CATTTYGTTT AGTTTTAACC TTTATTTAT6 1?0 

6ATAATAAAG GTTAATATTA ATAATGATTT ATTTTAAGGC ATTCCCRAAT TTGCATAATI 180 

CTCCTTTTGG AGATACCCTT TTATCTCCAG TGCAAGTCTG GATCAAAGTG ATASAMAGAA 240 



GTTCCTCTCA GTGC 
C2) INFORMATION FOR SEQ ID N0:78: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 

(B) TYPL: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



254 



(XI) SEQUENCE DESCRIPTION: SEO ID N0:78: 
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TTC6ATACAG GCAAACATGA ACTGCAGGAG GGTGGTGACG ATCATGATGT TGCCGATGGT • 60 

CCGGATGGNC ACGAAGACGC ACTGGANCAC GTGCTTACGT CCTHTGCTC TGTTGATGCC 1?0 

CCTGAGGGGA CGCAGGACCC TTATGACCCT CAGAATCTTC ACAACGGGA6 ATGGCACTGG 180 

ATTGAN7CCC ANTGACACCA GA6ACACCCC AACCACCAGN ATATCANTAT ATT6ATGTAG 240 

TTCCTGTAGA NG6CCCCCTT 6TGGAGGAAA GCTCCATNAG ITGGTCATCT TCAACAG6A1 300 

CTCAACAGTT TCCGATGGCT GTGATGGGCA TAGTCATAMT TAACCNTGTN TCGAA 355 
(?) INFORMATION FOR SEQ ID NO: 79; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTfl: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS; single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

lAAGAGGGTA CCAGCAGAAA GGTTAGTATC ATCAGATAGC ATCTTATACG AGTAATATGC 60 

CTGCTATTTG AAGTGTAATT GAGAAGGAAA ATTTTAGCGT 6CTCACTGAC CTGCCTGTAG 120 

CCCCAGTGAC AGCTAGGATG TGCATTCTCC AGCCATCAAG AGACTGAGTC AAGTTGTTCC 180 

TTAAGICAGA ACAGCAGACT CAGCTCTGAC ATTCTGATTC GAATGACACT GTTCAGGAAT 240 

CGGAATCCTG TCGATTAGAC T6GACAGCTT GTGGCAAGTG AATTTGCCTG TAACAAGCCA 300 

GATTTTTTAA AATTTATATT GTAAATAATG TGTGTGTGTG TGTGTGTATA TATATATATA 360 

TGTACAGTTA TCTAAGTTAA TTTAAAAGTT GTTTGGTACC CTCTTA 406 
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(2) INFORMATION FOR SCQ ID NO:80: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:80: 

TTTTTTTTTT TTTACTCGGC TCAGTCTAAT CCTTTTTGTA GTCACTCATA GGCCA6ACTT 60 

AGGGCTAGGA TGATGATTAA TAAGAGGGAT GACATAACTA TTAGTGGCAG GTTAGTTGTT 120 

TGTAGGGCTC ATGGTAGGGG TAAAAGGAGG GCAATTTCTA GATCAAATAA TAAGAAGGTA 180 

ATAGCTACTA AGAAGAATTT TATGGA6AAA GGGACGCGGG CGGGGGATAT AGGGTCGAAG 240 

CCGCACTCGT AAGGGGTGGA TTTTTCTATG TAGCCGTTGA GTTGT6GTA6 TCAAAATGTA 300 

ATAATTATTA 6TA6TAAGCC TAGGAGA 327 
(2) INFORMATION FOR SEQ ID N0:81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ 10 N0:81: 
TAGTCTATGC GGTT6ATTCG GCAATCCATT ATTTGCTGGA HTTGTCATG TGTTTTGCCA 60 
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AITGCATTCA TAATTTATTA TGCATTTATG CTTGTATCTC CTAA6TCATG GTATATAATC 120 

CATGCTTTTT ATGTTTTGTC TGACATAAAC TCTTATCAGA GCCCTTTGCA CACAGGGATT 180 

CAAIAAAFAT TAACACAGTC TACATTTATT TGGTGAATAT TGCATATCTG CTGTACTGAA 240 

AGCACATTAA GTAACAAAGG CAAGTGAGAA GAATGAAAA6 CACTACTCAC AACAGTTATC 300 

ATGATTGCGC ATAGACTA 313 
(2) INFORMATION FOR SEO ID N0:8?: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:82: 

rCTTCAACCl CTACTCCCAC TAATAGCTTT TTGATGACTT CTAGCAAGCC TCGCTAACCT 60 

CGCCTTACCC CCCACTATTA ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACGTTCTC 120 

CTGATCAAAT ATCACTCTCC TACTTACAGG ACTCAACATA CTAGTCACAG CCCTATACTC ISO 

CCTCTACATA TTTACCACAA CACAATGGGG CTCACTCACC CACCACATTA ACAACATAAA 240 

I " ACCCTCATTC ACACGAGAAA ACACCCTCAT 6TTCATACAC CTATCCCCCA TTCTCCTCCT 300 

ATCCCTCAAC CCCGACATCA TTACCGGGTT TTCCTCTT 338 
(2) INFORMATION FOR SEO ID N0:83: 
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(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 111 base pairs 
(R) TYPE: nudcic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0;83; 
AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AGGAATAAAA 
ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA 6GCATCACCC G 
(2) INFORMATION FOR SEQ ID N0:84: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) lYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:84: 

TCGGGTGATG CCTCCTCA6G CCAAGAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 

AAGGAAGAAA G6A6AAAAAA GGGCATCATC CCC6TTCCGA AGG6TCAGGG AG6AGGAAAT 

TGAG6TGGAT TCACGAGTTG CGGACAACTC CTTT6ATCCC AAGCGAGGTG CAGCCGGAGA 

CTGGGGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 

(2) INFORMATION FOR SEQ ID N0:85: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 348 base pairs 
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CB) TYPF: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:85: 

GCACTGAGAG GAACTTCGTT GGAAACGGGT TTTTTTCATG TAAGGCTAGA CAGAA6AATT 60 

CTCAGTAACT TCCTTGTGTT 6TGTGTATTC AACTCACASA GTTGAACGAT CCTTTACACA 120 

GAGCAGACTl GTAACACTCT TWTTGT6GAA TTTGCAAGTG GAGATTTCAG SCGCTTTGAA 180 

GTSAAAGGTA GAAAAGGAAA TATCTTCCTA TAAAAACTAG ACAGAATGAT TCTCAGAAAC 240 

TCCTTTGTGA TGTGTGCGTT CAACTCACAG AGTTTAACCT TTCWTTTCAT AGAAGCAGTT 300 

AGGAAACACT CTGTTT6TAA AGTCT6CAAG TGGATAGAGA CCCTAACG 348 
(2) INFORMATION FOR SEQ ID N0:86: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 293 base pairs 

(B) TYPF: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:86: 

GCACTGAGAG GAACTTCYTT GTGWTGTKTG YATTCAACTC ACAGAGTTGA ASSWTSMHT 60 

ACABAGWKCA GGCTTKCAAA CACTCTTTTT GTMGAATYTG CAAGWGGAKA TTTSRRCCRC 120 

TTTGWGGYCW WYSKTMGAAW MGGRWATATC TTCWYATMRA AMCTA6ACA6 AAKSATTCTC ISO 
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AKAAWSTYYY YTGTGAWGWS TGCRTTCAAC TCACAGAGKT KAACMWTYCT KYTSATRGAG ?40 
CAGTTWKGAA ACTCTMTTTC TTTGGATTCT GCAAGTGGAT AGAGACCCTA ACG ?93 
(?) INFORMATION FOR 5E0 ID N0:87: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:87: 
CTCCTAGGCT 10 
(?) INf-ORMATION FOR SEQ ID NO:88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:88: 
AGTAGTTGCC 10 
(2) INFORMATION FOR SEQ ID N0:89: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: II base pairs 

(B) TYPE: nucleic acid 

(C) STRANDFDNESS: single 



I 

I 
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(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:89: 
TTCCGTTATG C 11 
(?) INFORMATION FOR SEQ ID N0:90: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:90: 
TGGTAAAGGG 10 
(2) INFORMATION FOR SEQ ID N0:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:91: 
TCGGTCATAG 10 
(2) INFORMATION FOR SEQ ID N0:92: 
Ci) SEQUENCE CHARACTERISTICS; 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



i 

I (xi) SEQUENCE DESCRIPTION: SEQ ID N0:92: 



TACAACGAGG 10 
(?) INFORMATION FOR SEQ JO N0:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION; SEQ ID N0:93: 
TGGATTGGIC 10 
(2) INFORMATION FOR SEQ ID NO: 94: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:94: 
CTTTCTACCC 10 
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(2) INFORMATION FOR SEO ID N0:95: 

(1) SLQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:95: 
TTTTGGCTCC 10 
(2) INFORMATION FOR 5E0 ID NO: 96: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:96: 
GGAACCAATC 10 
(2) INFORMATION FOR SEQ ID N0:97: 

()) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:97: 
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TCGATACAGG 10 

•^^ (?) INFORMATION FOR SFO ID N0:98: 

I (i) SEQUENCE CHARACTERISTICS: 

I (A) LENGTH: 10 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEO ID N0:98: 

GGTACTAA6G 10 

(?) INFORMATION FOR SEQ ID N0:99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEO ID N0:99: 
AGTCTATGCG 10 
(2) INFORMATION FOR SEQ ID NO: 100: 



(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCL DESCRIPTION: 5EQ ID NO:100: 
CTATCCATGG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:101: 
TCTGTCCACA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 102: 
AAGA6G6TAC 

(2) INFORMATION FOR SEQ ID NO; 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
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(B) TYPF: nucleic acid 

(C) STRANDE0NES5: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 103: 
CTTCAACCTC 10 
(2) INFORMAIION FOR SEQ ID NO: 104: 

(1) SEOUENCF CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO-.m: 
GCTCCTCTTG CCTTACCAAC 20 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:105: 
GTAAGTCGAG CAGTGTGATG 20 
(2) INFORMATION FOR SEQ ID NO: 106: 
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(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GTAAGTCGAG CAGTCTGAT6 20 
(?) INFORMATION FOR SEQ ID NO: 107; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:107: 

GACTTAGTGG AAAGAAT6TA 20 

(?) INFORMATION FOR SEQ ID NO: 108; 

(1) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
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GTAATTCCGC CAACC6TAGT 20 
(2) INFORMATION FOR SCO 10 NO: 109: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ATGGTT6ATC 6ATAGTGGAA 20 
(2) INFORMAIION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
, (D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
ACGGGGACCC CTGCATT6AG 20 
(2) INFORMATION FOR SEQ ID NO: 111: 

% (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: Hi: 
TATTCTAGAC CATTCGCTAC 20 
(?) INFORMATION FOR SEQ ID NO: 112: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
ACATAACCAC TTTAGCGTTC 
(2) INFORMATION FOR SEQ ID NO: 113: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO; 113: 
CGGGTGATGC CTCCTCAGGC 20 
C2) INFORMATION FOR SEQ ID NO: 114: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 
([)) TOPOIOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
AGCATGTTGA GCCCAGACAC ?0 



(2) INrORMATION FOR SEQ ID NO: 115: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



GACACCTTGT CCAGCATCT6 20 
(2) INFORMATION FOR SEQ ID NO: 116: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPF: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
TACGCTGCAA CACTGTGGAG 20 
(?) INFORMATION FOR SEQ ID N0:117: 
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(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: hnear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117; 
CGTTAGGGTC TCTATCCACT 20 
(2) INFORMATION FOR SEQ ID NO: 118: 

(1) SEQUENCE CHARACTERISTICS; 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SLO ID NO: 118: 
AGACTGACTC ATGTCCCCTA 20 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:119: 



TCATCGCTCG GTGACTCAAG 



20 
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(2) INFORMATION FOR SEO ID NO: 120: 

(1) SEQUENCt CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO; 120: 
CAAGATTCCA TAGGCTGACC 20 
(2) INFORMATION FOR SEQ ID NO: 121: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:121: 
ACGTACIGGT CTTGAAGGTC 20 
(2) INFORMATION FOR SEQ ID NO: 122: 

(1) SEQUENCt CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:122; 
; GACGCHGCC CACTTGACAC ?0 

(?) INFORMATION FOR SEQ ID NO: 123: 

I (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
GTATCGACGT AGTGGTCTCC 20 
(2) INFORMATION FOR SEQ ID NO: 124: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:124: 
TAGT6ACATT AC6ACGCTGG 
C2) INFORMATION FOR SEQ ID NO: 125: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
CGGGTGATGC CTCCTCAGGC 20 
(2) INFORhiATION FOR SEQ ID NO: 126: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
([)) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:126: 
ATGGCTATTT TC6GGGGCTG ACA 23 
(2) INFORMATION FOR SEQ ID NO: 127: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
CCGGTATCTC CTCGTGGGTA TT 22 
(2) INFORMATION FOR SEQ ID NO: 128; 
(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 128: 
CTGCCTGAGC CACAAATG 18 
(?) INFORMATION FOR SEQ ID NO: 129: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:129: 
CCGGAGGAGG AAGCTAGAGG AATA 24 
(2) INFORMATION FOR SEQ ID NO: 130: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 130: 
HTHTTTTT TTAG 14 
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(?) INFORMATION FOR SEQ ID NO: 131: 

(I) SEQUENCF CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Ser Ser Gly Gly Arg Ihr Phe Asp Asp Phe His Arg Tyr Leu Leu Val 
15 10 lb 

Gly lie 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:132: 

Gin Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Xaa lie Glu Val 
15 10 15 

Val Gin Gly His Asp Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 133: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 23 amino acids 
(R) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x-i) SEQUENCE DESCRIPTION: SFQ ID N0:133: 

Sor Pro G'ly Val Phe Lou G1u His Leu Gin Glu Ala Tyr Arg He Tyr 
15 10 15 

Thr Pro Phe Asp Leu Ser Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 134: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
(8) TYPE: amino acid 
(C) STRANDEDNESS: Single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION; SEQ ID NO: 134: 

Tyr Lgu Leu Val Gly He Gin Gly Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 135: 

(1) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 9 amino acids 
(B) TYPE: amino acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(XI) SEQUENCE DESCRIPTION: SEO ID NO: 135: 

Gly Ala Ala Gin Lys Pro lie Asn Leu 
1 5 

(2) INFORMATION FOR SFO ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino adds 

(B) TYPE: ammo acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Asn Leu Ser Lys Xaa He Glu Val Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 137: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Glu Val Val Gin Gly His Asp Glu Ser 
1 5 

(?) INFORMATION FOR SEQ ID NO: 138: 
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(1) SEOUENCL CHARACTERISTICS: 
(A) LENGTH; 9 amino adds 
(R) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY; linear 



(xD SEQUENCE DESCRIPTION: SEQ ID N0:138; 

His l.eu Gin Glu Ala Tyr Arg He Tyr 
1 5 

(2) INFORMATION FOR SEO ID NO: 139: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 ammo acids 

(B) TYPE: amino add 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 139: 

Asn Leu Ala Phe Val Ala Gin Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 ammo adds 

(B) TYPE:, afmno add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO; 140: 

Phe Val Ala Gin Ala Ala Pro Asp Scr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCTCGCGGCC GCGAGCTCAA TTAACCCTCA CTAAAGGGAG TCGACTCGAT CAGACTGTTA 60 



CTGTGTCTAT GTAGAAAGAA GTAGACATAA GA6ATTCCAT TTTGTTCTGT ACTAAGAAAA 120 



ATTCTTCTGC CTTGAGATGC TGTfAATCTG TAACCCTA6C CCCAACCCTG TGCTCACAGA 180 



GACATGTGCT GTGTTGACTC AAGGTTCAAl GGATTTAGGG CTATGCTHG TTAAAAAAGT 240 



GCTTGAAGAT AATATGCTTG TTAAAAGTCA TCACCATTCT CTAATCTCAA GTACCCAGGG 300 



ACACAATACA CTGCGGAAGG CCGCAGGGAC CTCTGTCTAG 6AAAGCCAG6 TATTGTCCAA 360 



GATTTCTCCC CAT6T6ATAG CCTGA6ATAT 6GCCTCATGG GAAGGGTAAG ACCTGACTGT 420 



CCCCCAGCCC GACATCCCCC AGCCCGACAT CCCCCAGCCC GACACCCGAA AAGGGTCTGT 480 



GCTGAGGAGG ATTAGTAAAA GAGGAAGGCC TCTTTGCAGT TGAGGTAAGA GGAAGGCATC 540 



TGTCTCCTGC TCGTCCCTGG GCAATAGAAT GTCTTGGTGT AAAACCCGAT TGTATGTTCT 600 
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ACTTACTGAG ATAGGAGAAA ACATCCTTAG GGCTGGAGGT 6AGACACGCT GGCGGCAATA 660 

CTGCTCHTA ATGCACC6AG ATGTTTGTAT AAGTGCACAT CAAGGCACAG CACCTTTCCT 720 

TAAACTTATT TATGACACAG AGACCTTTGT TCACGTTTTC CTGCTGACCC TCTCCCCACT 780 

ATTACCCTAT TGGCCTGCCA CATCCCCCTC TCCGA6ATGG TAGAGATAAT GATCAATAAA 840 

TACTGAGGGA ACTCAGAGAC CAGTGTCCCT GTAGGTCCTC CGTGTGCTGA 6CGCCGGTCC 900 

CTTGGGCTCA CTTTTCTTTC TCTATACTTT GTCTCTGTGT CTCTTTCTTT TCTCAGTCTC 960 

TCGTTCCACC TGACGAGAAA TACCCACAGG TGTGGAGGGG CAGGCCACCC CTTCAATAAT 1020 

TTACTAGCCT GTTCGCTGAC AACAAGACTG GTGGTGCAGA AGGTTGGGTC TTGGTGTTCA 1080 

CCGGGTGGCA GGCATGGGCC AGGTGGGAGG GTCTCCAGCG CCTGGTGCAA ATCTCCAAGA 1140 

AAGTGCAGGA AACAGCACCA AGGGTGATTG TAAATHTGA TTTGGCGCGG CAGGTA6CCA 1200 

TTCCAGCGCA AAAATGCGCA GGAAAGCTTT TGCTGTGCTT GTAGGCAGGT AGGCCCCAAG 1260 

CACTTCTTAT TGGCTAATGT GGAGGGAACC TGCACATCCA TTGGCTGAAA TCICCGTCTA 1320 

TTTGAGGCT6 ACTGAGCGCG TTCCTTTCTT CTGTGTTGCC TGGAAACGGA CTGTCTGCCT 1380 

AGTAACATCT GATCACGTTT CCCATTGGCC GCCGTTTCCG GAAGCCCGCC CTCCCATTTC 1440 

CGGAAGCCTG GCGCAAGGTT G6TCTGCAGG TGGCCTCCAG GTGCAAAGTG GGAAGT6TGA 1500 

GTCCTCAGTC TTGGGCTATT CGGCCACGTG CCTGCCGGAC ATGGGACGCT GGA6GGTCAG 1560 

CAGCGTGGAG TCCTGGCCTT TTGCGTCCAC G6GTGGGAAA TTGGCCATTG CCAC6GC6GG 1620 

AACTGGGACT CAGGCTGCCC CCC6GCCGTT TCTCATCCGT CCACCGGACT CGTGGGC6CT 1680 
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CGCACTGGCG CTGATGTAGT HCCTGACCT CTGACCCGTA TTGTCTCCAG ATTAAAGGTA 1740 
AAAACGGGGC TTTTTCAGCC CACTCGGGTA AAACGCCTTT TGATTTCTAG 6CAGGTGTTT 1800 

TGTTGCACGC CT6GGAGGGA GTGACCCGCA GGTTGAGGTT TATTAAAATA CATTCCTGGT 18G0 

TTATGTTATG TTTATAATAA AGCACCCCAA CCTTTACAAA ATCTCACTTT TTGCCAGTTG 1920 

TATTATTTAG TGGACTGICT CTGATAAGGA CA6CCA6TTA AAATGGAATT TTGTTGTTGC 1980 

TAATIAAACC AATTTTTA6T TTTGGTGTTT GTCCTAATAG CAACAACTTC TCAGGCTTTA 2040 

TAAAACCAIA TTTCTTGGGG GAAAHTCTG TGTAAGGCAC AGCGAGTTAG TTTGGAATTG 2100 

TTTTAAAGGA AGIAAGTTCC IGGTHTGAT ATCTTA6TAG TGTAATGCCC AACCTG6TTT 2160 

TTACTAACCC TGTTTTTAGA CTCTCCCTTT CCTTAAATCA CCTAGCCTTG TTTCCACCTG 2220 

AATTGACTCT CCCTTAGCTA A6AGCGCCAG ATGGACTCCA TCTTGGCTCT TTCACTGGCA 2280 

GCCCCTTCCT CAAG6ACTTA ACTTGTGCAA GCTGACTCCC AGCACATCCA AGAATGCAAT 2340 

TAACTGTIAA GATACTGTGG CAAGCTATAT CC6CAGTTCC GAGGAATTCA TCCGATTGAT 2400 

TATGCCCAAA AGCCCCGCGT CTATCACCTT GTAATAATCT TAAAGCCCCT GCACCTGGAA 2460 

CTATTAACTT TCCT6TAACC ATTTATCCTT TTAACTHn TGCTTACTTT ATTTCTGTAA 2520 

AATTCTTTTA ACTA6ACCTC CCCTCCCCTT TCTAAACCAA AGTATAAAAG AA6ATCTA6C 2580 

CCCTTCTTCA GAGCGGAGAG AATTTTGAGC ATTAGCCATC TCTTGGCGGC CAGCTAAATA 2640 

AATGGACTTT TAATHGTCT CAAAGTGTGG CGTTTTCTCT AACTCGCTCA GGTAC6ACAT 2700 

TT6GAGGCCC CAGCGAGAAA CGTCACCGGG AGAAACGTCA CCGGGCGAGA GCCGGGCCCG 2760 
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CTGTGTGCTC CCCCGGAAGG ACAGCCA6CT TGTAGGGGGG AGT6CCACCT GAAAAAAAAA 2820 

TTTCCAGGIC CCCAAAGGGT GACC6TCTTC CGGAGGACAG CGGATC6ACT ACCATGC6GG 2880 

TGCCCACCAA AATTCCACCT CTGAGTCCTC AACTGCTGAC CCCGGGGTCA GGTAGGTCAG 2910 

AHTGACTTT GGTTCTGGCA GAGGGAAGCG ACCCTGATGA GGGTGTCCCT CTTTTGACTC 3000 

TGCCCATTTC TCTAGGAIGC TA6AGGGIA6 A6CCCTGGTT TTCT6TTAGA CGCCTCTGTG 3060 

TCTCTGTCTG GGAGGGAAGT GGCCCTGACA GGGGCCATCC CTTGAGTCAG TCCACATCCC 3120 

AGGATGCTGG GGGACTGAGT CCTGGTTTCT GGCAGACT6G TCTCTCTCTC TCTCTTTTIC 3180 

TATCTCTAAT CTTTCCTTGT TCAG6TTTC7 TG6AGAATCT CTGGGAAAGA AAAAAGAAAA 3240 

ACTGTTATAA ACTCTGTGTG AAT6GTGAAT GAATGGGGGA GGACAAGGGC TTGCGCTTGT 3300 

CCTCCAGTTT GTAGCTCCAC GGCGAAAGCT ACGGAGTTCA AGTGGGCCCT CACCTGCGGT 3360 

TCCGTGGCGA CCTCATAAGG CTTAAGGCAG CATCCGGCAT AGCTCGATCC GAGCCGGGGG 3420 

TTTATACCGG CCTGTCAATG CTAAGAGGA6 CCCAAGTCCC CTAA6GGGGA GCGGCCAGGC 3480 

6GGCATCTGA CTGATCCCAT CACGGGACCC CCTCCCCTTG TTTGTCTAAA AAAAAAAAAA 3540 

GAAGAAACTG TCATAACTGT TTACATGCCC TAGGGTCAAC TGnTGTTTT ATGHTATTG 3600 

T ICTGTTCGG TGICTAHGT CTTGTTTAGT GGTT6TCAAG GTTTTGCATG TCAGGACGTC 3660 

GATATTGCCC AAGACGTCTG GGTAAGAACT TCTGCAAGGT CCTTAGTGCT GATTTTHGT 3720 

CACAGGAGGT TAAATTTCTC ATCAATCATT TAGGCTGGCC ACCACAGTCC TGTCTTTTCT 3780 

GCCAGAAGCA AGTCAGGTGT TGnACGGGA ATGAGTGTAA AAAAACATTC GCCTGATTGG 3840 
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GATTTCTGGC ACCAT6ATGG TTGTATTTAG ATTGTCATAC CCCACATCCA G6TTGATTGG 3900 

ACCTCCTCTA AACTAAACTG GTGGTGGGTT CAAAACA6CC ACCCIGCAGA TTTCCTTGCT 3960 

CACCTCTTIG GTCATTCTGT AACTTTTCCT GTGCCCTTAA ATAGCACACT GTGTAGGGAA 402G 

ACCTACCCTC GTACTGCTTT ACTTCGTTTA GAHCTTACT CTGTTCCTCT GTGGCTACTC 4080 

(CCCATCTTA AAAACGATCC AAGT6GTCCT TTTCCTCCTC CCT6CCCCCT ACCCCACACA 41^0 

TCTCGTTTTC CAGTGCGACA GCAAGTTCAG CGTCTCCAGG ACTTGGCTCT GCTCTCACTC 4200 

CTTGAACCCT TAAAAGAAAA AGCTGGGTTT GAGCTATTTG CCTTTGAG7C ATGGAGACAC 4260 

AAAAGGTATT TAGGGTACAG ATCTA6AAGA AGAGAGAGAA CACCTAGATC CAACTGACCC 4320 

AGGA6ATCTC GGGCTGGCCT CTAGTCCTCC TCCCTCAATC TTAAAGCTAC AGTGATGTGG 4380 

CAAGTGGTAT TTAGCTGTTG TGGTTTTTCT GCTCTTTCTG GTCATGTTGA TTCTGTTCTl 4440 

TCGATACTCC AGCCCCCCAG GGAGTGAGTT TCTCTGTCTG TGCTGGGTTT GATATCTATG 4500 

TTCAAATCTT ATTAAATTGC CTTCAAAAAA AAAAAAAAAA GGGAAACACT TCCTCCCAGC 4560 

CTTGTAAGGG TTGGAGCCCT CTCCAGTATA TGCTGCAGAA TTTTTCTCTC GGTTTCTCAG 4620 

AGGATTATG6 AGTCCGCCTT AAAAAAGGCA AGCTCTGGAC ACTCTGCAAA GTAGAATGGC 4680 

CAAAGTTTGG AGTT6AGTGG CCCCTTGAAG GGTCACTGAA CCTCACAATT GTTCAAGCTG 4740 

TGTGGCGGGT TGTTACTGAA ACTCCCGGCC TCCCTGATCA GTTTCCCTAC ATTGATCAAT 4800 

GGCTGAGTTT GGTCAGGAGC ACCCCTTCCA TGGCTCCACT CATGCACCAT TCATAATTTT 4860 

ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC GACCCTCAGC CGGTTCAGCT 4920 
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CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC CTCACCCAGT CCCACC6CCT 4980 

TAAAACCAGC CTACTCCCH AGGGTCATCC CATGTCTCCT CGGCTATGTC CCCT6TAGGC 5040 

TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG AAGTAGCCCC TCTACTACCA 5100 

CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA CCCCCTTCCT GGTTTATGTC 5160 

CCTTCTTTCT ACTTCTGACT TGTATAATTG GAAAACCCAT AATCCTCCCT TCTCTGAAAA 5220 

GCCCCAGGCT TT6ACCTCAC TGATGGAGTC TGTACTCTGG ACACATTGGC CCACCTGGGA 5280 

TGACTGTCAA CAGCTCCTTT TGACCCTTTT CACCTCTGAA GAGAGGGAAA GTATCCAAAG 5340 

AGAGGCCAAA AAGTACAACC TCACATCAAC CAATAGGCCG GAGGAGGAAG CTAGA6GAAT 5400 

AGTGATTAGA GACCCAATTG GGACCTAATT GGGACCCAAA TTTCTCAAGT GGAGG6AGAA 5460 

CTTTTGACGA TTTCCACCGG TATCTCCTCG TGGGTATTCA GGGAGCTGCT CA6AAACCTA 5520 

TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA TGAGTCACCA GGAGTCHTT 5580 

TAGAGCACC7 CCAGGAGGCT TATCGGATTT ACACCCCTTT TGACCT6GCA GCCCCCGAAA 5640 

ATAGCCAT6C TCTTAAHTG GCATTTGTGG CTCAGGCAGC CCCAGATAGT AAAAGGAAAC 5700 

TCCAAAAACT AGAGGGAHT TGCTGGAATG AATACCAGTC AGCTTTTAGA GATAGCCTAA 5760 

AAGGTTTTTG ACAGTCAAGA G6TTGAAAAA CAAAAACAAG CAGCTCAGGC AGCTGAAAAA 5820 

AGCCACTGAT AAAGCATCCT 6GAGTATCAG AGTTTACTGT TAGATCAGCC TCATTTGACT 5880 

TCCCCTCCCA CATGCTGHT AAATCCAGCT ACACTACTTC CTGACTCAAA CTCCACTATT 5940 

CCTGTTCATG ACTGTCAGGA ACTGTTGGAA AC7ACTGAAA CTGGCCGACC TGATCTTCAA 6000 
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AATGTGCCCC TAGGAAAGGl GGATGCCACC GTGTTCACAG ACAGTAGCAG CTTCCTCGAG 6060 

AAGGGACTAC GAAAG6CCGG TGCAGCTGTT ACCATG6AGA CAGATGTGTT 6TGGGCTCAG 6120 

GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA ICGCCCTCAC TCAGGCTCTC 6180 

CGA16GGGTA AGGATATTAA CGTTAACACT GACAGCAGGT AC6CCTTTGC TACTGTGCAT 6240 

GTACGTG6AG CCATCTACCA GGAGCGTGG6 C1ACTCACCT CA6CAGGTGG CTGTAATCCA 6300 

CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT GGTAACCAGA AAGCTGATTC 6360 

A6CAGCTCAA GATGCAGTGT GACTTTCAGT CACGCCTCTA AACTTGCTGC CCACAGTCTC 6420 

CTTTCCACAG CCAGATCTGC CTGACAATCC CGCATACTCA ACAGAAGAAG AAAACT6GCC 6480 

TCAGAACTCA GAGCCAATAA AAATCAGGAA GGTTGGTGGA TTCTTCCTGA CICTAGAATC 6540 

TTCATACCCC 6AACTCTTGG GAAAACHTA ATCAGTCACC TACAGTCTAC CACCCATTTA 6600 

GGAG6AGCAA AGCTACCTCA GCTCCTCC6G AGCCGTTTTA A6ATCCCCCA TCTTCAAAGC 6660 

CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC AGGTAAAT6C CAAAAAAG6T 6720 

CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC CAGGAGAAAA GTGGGAAATT 6780 

GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT ACCTTCTAGT ACTGGTAGAC 6840 

ACCTTCTCTG GATGGACT6A AGCATTTGCT ACCAAAAACG AAACTGTCAA TATGGTAGTT 6900 

AAGTTTTTAC TCAATGAAAT CATCCCTCGA CGTGGGCTGC CTGTTGCCAT AGGGTCTGAT 6960 

AATGGACCGG CCHCGCCTT GTCTATAGTT TAGTCAGTCA GTAAGGCGTT AAACATTCAA 7020 

TGGAAGCTCC AHGIGCCTA TCGACCCCAG AGCTCTGGGC AAGTAGAACG CATGAACTGC 7080 
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ACCCTAAAAA ACACTCTTAC AAAATTAATC HAGAAACCG GTGTAAATTG TGTAAGTCTC 7140 

CTTCCTTTAG CCCTACTIAG AGTAAGGTGC ACCCCTTACT GGGCTGG6TT CTTACCTTTT 7200 

GAAAICATGl AT6GGAGGGC GCTGCCTATC TTGCCTAAGC TAAGAGATGC CCAATTGGCA 7260 

AAAATATCAC AAACTAATTT ATTACAGTAC CTACAGTCTC CCCAACAGGT ACAAGATATC 7320 

ATCCTGCCAC TTGTICGAGG AACCCATCCC AATCCAATTC CTGAACAGAC A6GGCCCTGC 7380 

CATTCATTCC CGCCAGGTGA CCTGTTGTTT GTTAAAAAGT TCCAGAGAGA AGGACTCCCT 7440 

CCTGCTTGGA AGAGACCTCA CACCGTCATC ACGATGCCAA CGGCTCTGAA GGTGGATGGC 7500 

ATTCCTGCGT GGATTCATCA CTCCCGCATC AAAAAGGCCA ACGGAGCCCA ACTA6AAACA 7560 

TGGGTCCCCA GGGCTGGGTC AGGCCCCTIA AAACTGCACC TAAGTTGGGT GAAGCCATTA 7620 

GATTAATICT TTTTCTTAAT TTTGTAAAAC AATGCATAGC TTCTGTCAAA CTTATGTATC 7680 

TTAAGACTCA ATATAACCCC CTTGTTATAA CTGAGGAATC AAT6ATTT6A TTCCCCAAAA 7740 

ACACAAGTGG GGAAT6TAGT GTCCAACCTG GTTTTTACTA ACCCTGTTTT TAGACTCTCC 7800 

CTTTCCTTTA ATCACTCAGC CTTGTTTCCA CCTGAATTGA CTCTCCCTTA 6CTAAGAGCG 7860 

CCAGATGGAC TCCATCTTGG CTCTTTCACT G6CAGCCGCT TCCTCAAGGA CTTAACTTGT 7920 

GCAAGCTGAC TCCCAGCACA TCCAAGAATG CAATTAACTG ATAAGATACT GTGGCAAGC7 7980 

A1ATCCGCA6 HCCCAGGAA HCGTCCAAT T6ATTACACC CAAAAGCCCC GCGTCTATCA 8040 

CCTTGTAATA ATCTTAAAGC CCCTGCACCT GGAACTATTA AC6TTCCTGT AACCATTTAT 8100 

CCTTTTAACT TTTTTGCCTA CTTTATTTCT GTAAAATTGT TTTAACTAGA CCCCCCCTCT 8160 
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CCTTTCTAAA CCAAAGTATA AAAGCAAATC TAGCCCCTTC TTCAGGCCGA GAGAATTTCG 8220 
AGCGHAGCC GTCTCnGGC CACCAGCTAA ATAAACGGAT TCTTCATGTG TCTCAAAGTG 8280 
TGGCGTTnC TCTAACTCGC TCAG6TACGA CCGT6GTAGT ATTTTCCCCA ACGTCTTATT 8340 

I TTTA6GGCAC GTATGTAGAG TAACTHTAT GAAAGAAACC AGTTAAGGAG GTTTTGGGAT 8400 

TTCCTTTATC AACTGTAATA CTGGTTTT6A TTATTTATH ATTTATTTAT TTTTTTTGAG 8460 
AAGGAGTTTC ACTCTTGTTG CCCAGGCTGG AGTGCAATGG TGCGATCTTG GCTCACTGCA 8520 

ACTTCCGCCT CCCAGGHCA AGCGATTCTC CTGCCTCAGC CTCGAGAGTA GCTGGGATTA 8580 

TAGGCATGCG CCACCACACC CAGCTAATTT TGTATTTTTA GTAAAGATGG G6TTTCTTCA 8640 

TGTTGGTCAA GCTGGTCTGG AACTCCCCGC CTCGGGTGAT CTGCCCGCCT CGGCCTCCGA 8700 

AAGTGCTGGG ATTACAGGTG TGATCCACCA CACCCAGCCG ATTTATATGT ATATAAATCA 8760 

CATTCCTCTA ACCAAAATGT AGTGTTTCCT TCCATCTTGA ATATAGGCTG TAGACCCCGT 8820 

GGGTATG66A CATTGTTAAC AGTGAGACCA CAGCAGTTTT TATGTCATCT GACAGCATCT 8880 

CCAAATAGCC TTCATGGTTG TCACTGCHC CCAAGACAAT TCCAAATAAC ACHCCCAGT 8940 

GATGACTTGC TACTTGCTAT TGTTACTTAA TGTGTTAAGG TGGCTGTTAC AGACACTATl 9000 

AGTATGTCAG GAATTACACC AAAATTTAGT GGCTCAAACA ATCATTTTAT TATGTATGTG 9060 

i GATTCTCATG GTCAGGTCAG GATTTCAGAC AGGGCACAAG GGTAGCCCAC TTGTCTCTGT 9120 

CTATGATGTC TGGCCTCAGC ACAGGAGACT CAACAGCTGG GGTCTGGGAC CAHIGGAGG 9180 

CTTGTTCCCT CACATCTGAT ACCTGGCTTG GGATGnCGA AGAGGGGGTG AGCTGAGACT 9240 
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GAGT6CCTAT AT6TAGTGTT TCCATATGGC CTTGACTTCC TTACAGCCTG GCAGCCTCAG 9300 



GGTAGTCAGA ATTCTTAGGA GGCACAGGGC TCCAGGGCAG ATGCTGAGGG GTCTTTTA7G 9360 



AGGTAGCACA GCAAATCCAC CCAGGATC 



9388 



(2) INFORMATION FOR SEQ ID NO; 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



TGTAA6TCGA GCAGTGTGAT GGAAGGAATG GTCTTTGGAG AGAGCATATC CATCTCCTCC 60 



TCACTGCCTC CTAATGTCAT GAG6TACACT GAGCAGAATT AAACAGGGTA GTCTTAACCA 120 



CACTATTTTT AGCTACCTTG TCAAGCTAAT GGTTAAAGAA CACTTTTGGT TTACACTTGT 180 



TGGGTCATAG AAGTTGCTTT CCGCCATCAC 6CAATAAGTT TGTGT6TAAT CAGAAGGA6T 240 



TACCTTATGG TTTCAGTGTC ATTCTTTAGT TAACTTGGGA GCTGTGTAAT TTAGGCTTTG 300 



CGTATTATTT CACTTCTGTT CTCCACTTAT 6AAGTGATT6 TGTGTTC6CG TGTGTGTGC6 360 



TGCGCATGTG CTICCGGCAG TTAACATAAG CAAATACCCA ACATCACACT GCTCGACTT 419 



(2) INFORMATION FOR SEQ ID NO: 143: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:143: 

TGIAAGTCGA GCAGTGTGAT GTCCACT6CA 6TGTGTTGCT GG6AACAGTT AATGAGCAAA 60 

TTGTATACAA TGGCTAGTAC ATTGACC66G ATTTGTTGAA 6CTGGTGAGT GTTATGACTT 120 

A6CCTGTTAG ACTAGTCTAT GCACATGGCT CTGGTCAACT ACCGCTCTCT CATTTCTCCA 180 

GATAAATCCC CCATGCTTTA TATTCTCTTC CAAACATACT ATCCTCATCA CCACATAGTT 240 

CCTTTGTTAA TGCTTTGTTC TAGACTTTCC CTTTTCTGTT TTCTTATTCA AACCTATATC 300 

TCTTT6CATA 6ATTGTAAAT TCAAATGCCC TCAGG6TGCA GGCA6TTCAT GTAAGGGAGG 360 

GA66CTAGCC AGTGAGATCT GCATCACACT GCTCGACHA CA 402 
(2) INFORMATION FOR SEO ID NO: 144: 

(T) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

TCGGGTGATG CCTCCTCAGG CCAAGAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 60 

AAGGAAGAAA GGAGAAAAAA GGGCATCATC CCC6TTCCGA AGGGTCAGGG AGGAGGAAAT 120 

TGAGGTGGAT TCACGAGTTG CGGACAACTC CTTTGATGCC AAGCGAGGTG CAGCCGGAGA 180 
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CTGGGGAGA6 CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 
(2) INFORMATION FOR SEO ID NO: 145: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEO ID NO: 145: 
AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AG6AATAAAA 
ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA GGCATCACCC G 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: niidetc acid 

(C) STRANDEONESS: single 
{[)) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
TAGCATGTTG AGCCCAGACA CTTGTAGAGA GAGGA6GACA GTTAGAAGAA GAA6AAAAGT 
TTTTAAATGC TGAAAGTTAC TATAAGAAAG CTTTGGCTTT GGATGAGACT TTTAAAGATG 
CAGAGGATGC TTTGCA6AAA CTTCATAAAT ATATGCAGGT GATTCCTTAT TTCCTCCTAG 
AAATTTAGTG ATAHTGAAA TAATGCCCAA ACTTAATTTT CTCCTGAGGA AAACTATTCT 



60 
120 
180 
240 
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ACATTACTTA AGTAAGGCAI TATGAAAAGT TTCTTTITAG GTATAGTTTT TCCTAATTQG 300 

GTTT6ACATT GCTTCATAGT GCCTCTGTTT TTGTCCAIAA TC6AAAGTAA AGATAGCTGT 360 

6AGAAAACTA TTACCTAAAT TTGGTATGTT 6TTTTGAGAA ATGTCCnAT AGGGAGCTCA 420 

CCTGGTGGTT TTTAAATTAT TGTTGCTACT ATAATTGAGC TAATTATAAA AACCTTTTTG 480 

ACACATATTT TAAATTGTCT TTTCCTGTAA TACTGATGAT GATGTTTTCT CATGCATTTT 540 

CTTCT6AATT GGGACCATTG CTGCTGTGTC TGGGC7CACA TGCTA 685 
(2) INFORMATION FOR SEQ ID MO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5/9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TAGCATGTTG AGCCCAGACA CTGGGCAGCG GGGGTGGCCA CGGCAGCTCC TGCCGA6CCC 60 

AAGCGTGTTT GTCTGTGAAG GACCCTGACG TCACCTGCCA GGCIAGGGAG GGGTCAATGT 120 

GGAGT6AATG TTCACCGACT HCGCAGGAG TGTGCAGAAG CCAGGTGCAA CTTGGHTGC 180 

TTGTGTTCAT CACCCCTCAA GATATGCACA CTGCTHCCA AATAAAGCAT CAACT6TCAT 240 

CTCCAGATGG GGAAGACTTT TTCTCCAACC AGCAGGCAGG TCCCCATCCA CTCAGACACC 300 

AGCACGTCCA CCTTCTCGGG CAGCACCACG TCCTCCACCT TCTGCTGGTA CACGGTGATG 360 
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ATGTCAGCAA AGCCGTTCTG CANGACCAGC TGCCCCGTGT GCTGTGCCAT CTCACTGGCC 420 

TCCACCGCG1 ACACCGCTCT AGGCC6CGCA TANTGTGCAC AGAANAAAT6 ATGATCCAGT 480 

(XCACAGCCC ACGTCCAAGA MGACTTTATC CGTCAGGGAT TCTTTATTCT GCAGGATGAC 540 

CTGTGGTATT AATTGTTCGT GTCTGG6CTC AACATGCTA 579 
(2) INFORMATION FOR SEQ ID NO: 148; 

(1) SEQUFNCC CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 148: 
TGACACCTTG TCCAGCATCT GCAAGCCAGG AAGAGAGTCC TCACCAAGAT CCCCACCCCG 60 
TTGGCACCAG GATCTTGGAC TTCCAATCTC CAGAACTGTG AGAAATAAGT ATTTGTCGCT 120 
AAATAAATCT TTGTGGTTTC AGATATTTAG CTATAGCAGA TCAGGCTGAC TAAGAGAAAC 180 
CCCATAAGAG TTACATACTC ATTAATCTCC GTCTCTATCC CCAGGTCTCA GATGCTG6AC 240 
AAGGTGTCA 

(2) INFORMATION FOR SEO ID NO: 149: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 
(0) TOPOLOGY: linear 



249 
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(xi) SEQUtNCC DESCRIPTION: SEQ ID NO: 149: 

TGACACCTTG TCCAGCATCT GCTATTTTGT GACTTTTTAA TAATAGCCAT TCTGACTGGT 60 

GTGAGATGGT AACTCATTGT GGGTTTGGTC TGCATTTCTC TAATGATCAG TGATATTAAG 1?0 

CTTTTTTTAA ATAIGCTTGT TGACCACATG TATATCATCT TTTGAGAAGT GTCTGTTCAl 180 

ATCCTTTGCC GACTTTTTAA TTTTTTTATC TT6TAAA1TT GTTTAATTTC CTTACAGATG 240 

CTGGACAAGG TGTCA 255 
(2) INFORMATION TOR SEQ ID NO: 150: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TTACGCTGCA ACACTGTGGA GGCCAAGCTG GGATCACTTC TTCATTCTAA CTGGAGAGGA 60 

6GGAAGTTCA AGTCCAGCA6 AG6GTGGGTG G6TAGACAGT GGCACTCAGA AATGTCAGCT 120 

GGACCCCTGT CCCCGCATAG GCAGGACAGC AAGGCTGTGG CTCTCCAGGG CCAGCTGAAG 180 

AACAGGACAC TGTCTCCGCT GCCACAAAGC GTCAGAGACT CCCATCTHG AA6CACGGCC 240 

TTCTTG6TCT TCCTGCACTT CCCTGTTCTG TTAGAGACCT GGTTATAGAC AAGGCTTCTC 300 

CACAGTGTTG CAGCGTAA 318 
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(2) INFORMATION FOR SEQ ID NO: 151: 

(1) SLQUENCE CHARACTERISTICS: 

(A) LENGTHS 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

TNACGCNGCN ACNNTGTAGA GANGGNAAGG CNTTCCCCAC ATTNCCCCTT CATNANAGAA 60 

TTATTCNACC AAGNNTGACC NATGCCNTTT ATGACTTACA TGCNNACTNC NTAATCTGTN 120 

TCNNGCCTTA AAAGCNNNTC CACTACATGC NTCANCACTG TNTGTGTNAC NTCATNAACT 180 

GTCNGNAATA GGGGCNCATA ACTACAGAAA TGCANTTCAT ACTGCTTCCA NTGCCATCN6 240 

CGTGTGGCCT TNCCTACTCT TCTTNTATTC CAAGTAGCAT CTCTGGANTG CTTCCCCACT 300 

CTCCACATTG TT6CAGCNAT AAT 323 
(2) INFORMATION FOR SEQ ID NO: 152: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic add 
CO STRANDEONESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:152: 
TCAAGATTCC ATAGGCTGAC CAGTCCAAGG A6AGTTGAAA TCATGAAGGA GAGTCTATCT 60 
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GGAGAGAGCT GTAGTTTTGA GG6TTGCAAA GACHAGGAT GGAGTTGGTG GGTGTGGTTA 120 

GTCTCIAAGG TTGATTTTGT TCATAAATTT CATGCCCTGA ATGCCTTGCT TGCCTCACCC 180 

TGGTCCAAGC CTTAGTGAAC ACCTAAAAGT CTCTGTCTTC TTGCTCTCCA AACTTCTCCT 240 

GAGGATTTCC TCAGATTGTC TACATTCAGA TCGAAGCCAG TIGGCAAACA AGATGCAGTC 300 
CAGAGGGTCA G 

oil 

(2) INFORMATION FOR SFQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 153: 

CAAGATTCCA TAGGCTGACC AG6AGGCTAI TCAAGATCTC TGGCAGTTGA GGAAGTCTCT 60 

TTAAGAAAAT AGTT lAAACA ATTTGTTAAA ATTTTTCTGT CTTACTTCAT TTCTGTAGCA j20 

GTTGATATCT GGCTGTCCTT TTTATAATGC A6AGTGGGAA CTTTCCCTAC CATGTTTGAT 180 

AAATGTT6TC CAGGCTCCAT TGCCAATAAT GTGTTGTCCA AAATGCCTGT TTAGTTTTTA 240 

AAGACGGAAC TCCACCCni GCHGGTCTT AAGTATGTAT GGAATGTTAT GATAGGACAT 300 

AGTAGTAGCG GTGGTCAGCC TATGGAATCT TG 332 
(2) INFORMATION FOR SEO ID NO: 154: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 

i 

(xi) SEQUCNCE DESCRIPTION: SEQ ID NO; 154: 

< TCAAGATTCC ATAGGCTGAC CTGGACAGAG ATCTCCTGGG TCTGGCCCAG 6ACAGCAGGC 60 

TCAACCTCAG TGGAGAAGGT TTCCATGACC CTCAGATTCC CCCAAACCTT GGATTGGGTG 1?0 
ACATTGCATC TCCTCAGAGA GGGAGGAGAT GTANGTCTGG GCTTCCACA6 GGACCTGGTA 180 

I TTfTAGGATC AGGGTACCGC TGGCCTGAGG CTTGGATCAT TCANAGCCTG GGG6TGGAAT 240 

GGCTGGCAGC CTGTGGCCCC ATTGAAATAG GCTCTGGGGC ACTCCCTCTG TTCCTANTTG 300 
AACTTG6GTA A6GAACAGGA AT6TGGTCAN CCTATGGAAT CTTGA 345 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

I ' (xi) SEQUENCE DESCRIPTION: SEO ID NO: 155: 

GACGCTTGGC CACTTGACAC ATTAAACAGT HTGCATAAT CACTANCATG TATTTCTAGT 60 
TTGCTGTCTG CTGTGATGCC CTGCCCTGAT TCTCTGGCGT TAATGATGGC AAGCATAATC 120 
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AAACGCTGTT CTGT7AATTC CAAGTTATAA CTGGCATTGA TTAAAGCATT ATCTTTCACA 180 

ACTAAACTGT TCTTCATAWA ACAGCCCATA TIATTATCAA ATTAAGAGAC AATGTATTCC ?40 

AATATCCTTT ANGGCCAATA TATTTNATGT CCCTIAATTA AGA6CTACTG TCCGT 295 

(2) INFORMATION FOR SEQ ID NO: 156; 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:156: 

GACGCTTGGC CACTTGACAC TGCAGTGGGA AAACCAGCAT GACCCGCTGC CCCCAA6GAA 60 

CCTCGAAGCC CAGGCAGAGG ACCAGCCAIC CCAGCCTGCA GGTAAAGTGT 6TCACCTGTC 120 

AGGTGGGCTT GGGGTGAGTG GGTGG6GGAA GTGTGTGTGC AAAGGGGGTG TNAATGTNTA 180 

TGCGTGTGAG CATGAGTGAT GGCTAGTGTG ACTGCATGTC A6G6AGTGTG AACAAGCGTG 240 

CGGGGGTGTG TGTGCAAGTG CGTATGCATA TGAGAATATG TGTCTGTGGA TGAGTGCATT 300 

TGAAAGTCTG TGTGTGTGC6 TGTGGTCATG ANGGTAANTT ANTGACTGCG CAGGATGTGT 360 

GAGTGTGCAT GGAACACTCA NTGTGTGT6T CAAGTGGCCN ANCGTC 406 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 208 base pairs 

(B) TYPE; nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO 10 NO: 157: 

TGACGCTTGG CCACTTGACA CACTAAAGGG TGTTACTCAT CACTTTCTTC TCTCCTCGGT 60 

GGCATGTGAG TGCATCTATT CACTTGGCAC TCATTT6TTT GGCAGTGACT GTAANCCANA J20 

TCIGATGCAT ACACCAGCTT GTAAATTGAA TAAATGTCTC TAATACTATG TGCTCACAAT 180 



ANGGfANGGG TGAGGAGAAG GGGAGAGA 
(?) INFORMATION FOR SEQ ID NO: 158: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: niicleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



208 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 158: 

CTTCAACCTC CTICAACCTC CTTCAACCTC CTGGATTCAA ACAA7CATCC CACCTCAGAC 60 

TCCTTAGTAG CTGAGACTAC AGACTCACGC CACTACATCT G6CTAAATTT TTGTAGAGAT 120 

AGGGTTTCAT CATGTTGCCC TGGCTGGTCT CAAACTCCTG ACCTCAAGCA ATGTGCCCAC ISO 

CTCAGCCTCC CAAAGT6CTG GGATTACAG6 CATAAGCCAC CATGCCCAGT CCATNTTTAA 240 

TCTTTCCTAC CACATTCTTA CCACACTTTC TTTTATGTTT AGATACATAA ATGCTTACCA 300 

TTATGATACA ATTGCCCACA GTATTAA6AC AGTAACATGC TGCACAGGTT TGTAGCCTAG 360 
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GAACAGTAGG CAATACCACA TAGCTTAGGT GTGTGGTAGA CTATACCATC TAG6TTT6TG 420 

TAAGTTACAC TTTATGCTGT TTACACAATG ACAAAACCAT CTAATGATGC ATTTCTCAGA 480 

ATGTATCCTT GTCAGTAAGC TATGATGTAC AGGGAACACT GCCCAAGGAC ACA6ATATTG 540 

TACCTGT 547 
(?) INFORMATION FOR 5EQ ID NO: 159: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) SIRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:159: 

GCTCCTCTTG CCTTACCAAC TCACCCAGTA TGTCAGCAAT TTTATCR6CT TTACCTACGA 60 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATCGCCTT 120 

CTCATGGGTC TCTCT6CTCC AGTTCTGAAC CTTTCTCTTT TCCTAGAACA TGCATTTARG 180 

TCGATA6AAG TTCCTCTCAG TGC 203 
(2) INFORMATION FOR SEQ ID NO: 160: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEOUENCL DESCRIPTION: SEO ID NO: 160: 
TGTAAGTCGA GCAGTGTGAT GGGT6GAACA GGGTTGTAAG CAGTAATTGC AAACTGTATT 60 
TAAACAATAA TAATAATATT TAGCATTTAT AGAGCACTTT ATATCTTCAA AGTACTTGCA 120 
AACATTAYCT AATTAAATAC CCTCTCTGAT TATAATCfGG ATACAAAIGC ACTTAAACTC 180 
AGGACAGGGT CATGA6ARAA GTATGCATTT GAAAGTTGGT GCTAGCTATG CTTTAAAAAC 240 
CTATACAATG ATGGGRAAGT TAGAGTTCAG ATTCTGTTGG ACTGTTTTTG TGCATTTCAG 300 
TTCAGCCTGA TGGCAGAATT AGATCATATC TGCACTCGAT GACTYTGCTT GATAACTTAT 360 
CACTGAAATC T6A6TGTTGA TCATCACACT GCTCGACTTA CA 402 
(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 161: 

AGCATGTTGA GCCCA6ACAC TGACCAGGAG AAAAACCAAC CAATAGAAAC ACGCCCAGAC 60 

ACTGACCAGG AGAAAAACCA ACCAATAAAA ACAGGCCCGG ACATAAGACA AATAATAAAA 120 

TTAGCGGACA AGGACATGAA AACAGCTATT GTAAGAGCGG ATATAGTG6T GTGTGTCTGG 180 

GCTCAACATG CTA I93 
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(2) INFORMATION FOR SEQ ID NO: 16?: 

(i) StOUENCE CHARACTERISTICS; 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

TGTTGAGCCC AGACACT6AC CAGGAGAAAA ACCAACCAAT AAAAACAGGC CCGGACATAA 60 

GACAAATAAT AAAATTAGCG GACAAGGACA TGAAAACAGC TATTGTAAGA GCGGATATAG 120 

TGGTGTGTGF CTGGGCTCAA CATGCTA 147 

(2) INFORMATION FOR SEQ ID NO: 163: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 294 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

TAGCATGTTG AGCCCAGACA CAAATCTTTC CTTAAGCAAT AAATCATTTC TGCATATGTT 60 

TTTAAAACCA CAGCTAAGCC ATGATTATTC AAAAGGACTA TTGTATTGGG TATHTGATT 120 

TGGGTTCTTA TCTCCCTCAC ATTATCTTCA TTTCTATCAT TGACCTCTTA TCCCAGAGAC 180 

TCTCAAACTT TTATGTTATA CAAATCACAT TCTGTCTCAA AAAATATCTC ACCCACTTCT 240 
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CTTCTGTTTC TGCGTGTGTA TGTGTGTCTG TGTGTGTCTG GGCTCAACAT GCTA 294 
(2) INFORMATION FOR SEQ ID NO; 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:164: 

CGGGATTGGC TTTGAGCTGC A6ATGCTGCC TGTGACCGCA CCCGGCGTGG AACAGAAAGC 60 

CACCTGGCTG CAAGTGCGCC AGAGCCGCCC TGACTACGT6 CTGCTGTGGG GCTGGGGCGT 120 

GATGAACTCC ACCGCCCTGA AGGAAGCCCA GGCCACCGGA TACCCCCGCG ACAAGATGTA 180 

CGGCGTGTGG TGGGCCGGTG CG6AGCCCGA TGTGCGTGAC GTGG6CGAA6 GCGCCAAGGG 240 

CTACAACGCG CTGGCTCTGA ACGGCTACGG CACGCAGTCC AAGGTGATCC ANGACATCCT 300 

GAAACACGTG CACGACAAGG GCCAGGGCAC GGGGCCCAAA GACGAAGTGG GCTCGGTGCl 360 

GTACACCCGC GGCGTGATCA TCCAGATGCT GGACAAGGTG TCAATCACTA AT 412 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(x1) SEQUENCt DESCRIPTION; SEQ ID N0:165: 

TTGACACCTT GTCCAGCATC TGCATCTGAT GAGAGCCiCA GATGGCTACC ACTAATGGCA 60 

GAAGGCAAAG GAGAACAGGC ATTGTATGGC AAGAAAGGAA GAAAGAGAGA GGGGAGAAA6 120 

GTGCTAGGTF CTTTTCAACA ACCAGTTCTT 6AT6GAACTG AGA6TAAGAG CTCAA6GCCA 180 

GGTGTGGTGA CICCAACCAG TAATCCCAAC ATTTTAGGAG GCTGAG6CAG GCAGATGTCT 2^10 

TGACCCCATG AGTTTGTGAC CAGCCTGAAC AACATCATGA GACTCCATCT nkZmm 300 

AATCAGGCAT TGTGGTATGC CCTGTAGTCC CAGATGCTGG ACAAGGTGTC 360 

A 36] 
(2) INFORMATION FOR SEQ ID NO: 166: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 166: 

iWGACTGACT CATGTCCCCT ACACCCAACT ATCTTCTCCA GGTGGCCAGG CATGATAGAA 60 

TCT6ATCCTG ACTTA6GGGA ATATTTTCTT TTTACTTCCC ATCTTGATTC CCTGCCGGTG 120 

AGTTTCCTGG TTCAGGGTAA GAAAGGAGCT CAGGCCAAAC TAAIGAACAA ATCCATCCTC 180 

ACAGACGTAC AGAATAAGAG AACWTGGACW TAGCCAGCAG AACMCAAKTG AAAMCAGAAC 240 

MCTTAMCTAG GATRACAAMC MCRRARATAR KTGCYCMCMC WTATAATAGA AACCAAACH 300 
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GTATCTAATT AAATATHAT CCACYGTCAG GGCATTAGTG GTTTTGATAA ATACGCTTTG 360 

GCTAGGATIC CTGAGGTTAG AATGGAARAA CAATTGCAMC GAGGGTAGGG GACATGAGTC 420 

AKTCTAA 4?7 

{?) INFORMATION FOR SEQ ID NO: 167: 

(i) SFQUENCE CHARACTERISTICS: 
(A) LENGTH: 500 base pairs 
CB) TYPE; nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

AACGTCGCAT GCTCCCGGCC GCCATGGCCG CGG6ATAGAC TGACTCATGT CCCCTAAGAT 60 

AGAGGAGACA CCTGCTAGGT GTAAGGAGAA GATGGHAGG TCTACGGAGG CTCCA6GGTG 120 

GGAGTAGTTC CCTGCTAAGG GAGGGTAGAC TGTTCAACCT GTTCCTGCTC CGGCCTCCAC 180 

TATAGCAGAT GCGAGCAGGA GTAGGAGAGA GGGAGGTAAG AGTCA6AAGC TTATGTTGTT 240 

TATGCGGG6A AACGCCRTAT CGG6GGCAGC CRAGTTATTA GGGGACANTR TAGWYARTCW 300 

AGNTAGCATC CAAAGCGNGG GAGTTNTCCC ATATGGTTGG ACCTGCAGGC GGCCGCATTA 360 

GTGATTAGCA TGTGAGCCCC AGACACGCAT AGCAACAA6G ACCTAAACTC AGATCCTGT6 420 

CTGATTACTT AACATGAATT ATTGTATTTA TTTAACAACT TTGAGTTATG AGGCATATTA 480 

.9725426A3_IB> 
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TTAGGTCCAT ATTACCTGGA 



500 



(2) INFORMATION FOR SEQ ID NO: 168: 



I 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 358 base pairs 

(B) rvPF: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xj) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



i 



TTCATCGCTC 6GTGACTCAA GCCTGTAATC CCA6AACTTT GGGAG6CCGA GGGGAGCAGA 60 
TCACCTGAGG TTGGGAGTTT GA6ACCAGCC TGGCCAACAT GGTGACAACC CGTCTCTGCT 120 



AAAAATACAA AAATTAGCCA AGCATGGTGG CATGCACHG TAATCCCAGC TACTCGGGAG 180 

GCTGAG6CAG GAGAATCACT TGAGGCCAGG AGGCAGAGGT TGCAGT6AGG CAGAGGTTGA 240 

GATCATGCCA CT6CACTCCA GCCTGGGCAA CAGAGTAAGA CTCCATCTCA AAAAAAAAAA 300 

AAAAAAAGAA TGATCAGAGC CACAAATACA GAAAACCTTG AGTCACCGAG CGATGAAA 358 
(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 
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TTCTGTCCAC ACCAATCTTA 
AACATGTATT TTATGGACCA 

'7; 

^ ' TGAATITCAG CACACTGAGT 

1 

TATTTATTTA AGATTGATTC 
^ AGGTAGAACA AATACTTTCT 

ACTCTAAAAA AACAGAGAAC 
ATATTCACTT AAAAAAATGA 
I GGAAAAACTT AGTCACCCTG 

ARGTTTGGGG GTGGACATTG 
GTATGGGTCA GAGAAAATGA 
CCACATGTCT CAGCTATATT 
TATIAATTTC CAAA6GGTTT 
ATGGGTTGAT ATTTTTCTTT 
CCT6TTTTAT TTTAAAACTT 
ATTTTGTTGG TTHTTACTT 

I 

GCAGTTTCTT TAACCAATGT 
TGCATTTCAA CTATGTCAAT 
TTTATTTACA ATATGTTTAA 
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GAGCTCTGAA A6AATTTGTC TTTAAATATC TTTTAATAGT 
AATTGACATT TTCGACTATT TTTTCCCAAA AAAAGTCAGG 
TGGGAATTTC TTATCCCA6A AGWCGGCACG AGCAATTTCA 
CATACTCCGT TTTCAAGGAG AATCCCTGCA GTCTCCTTAA 
ATniTTTTT CACCATTGTG GGATTG6ACT TTAAGAGGTG 
AAATATGTCT CAGTTGTATT AAGCACGGAC CCATATTATC 
TTTCCTGTGC ACCTTTTGGC AACTTCTCTT TTCAATGTAG 
AAAACCCACA AAATAAATAA AACTTGTAGA TGTGGGCAGA 
TATGT6TTTA AATTAAACCC T6TATCACTG AGAAGCTGTI 
ATGCTTAGAA GCTGHCACA TCTTCAAGAG CAGAAGCAAA 
ATTATTTATT TTTTATGCAT AAAGTGAATC ATTTCTTCTG 
TACCCTCTAT TTAAATGCTT TGAAAAACAG TGCATTGACA 
AAAAGAAAAA TATAATTATG AAAGCCAAGA TAATCTGAA6 
TTTATGTTCT GTGGTTGATG TTGTTTGTTT GTTT6TTTCT 
TGTTTTTTGT TTTGTTTTGT TTTGGTTTDG CATACTACAT 
CTGTTTGGCT AATGTAATTA AAGTTGTTAA TTTATATGAG 
GGTTTCTTAA TATTTAnGT GTAGAAGTAC TG6TAATTTT 
AGAGATAACA GTTTGATATG TTTTCATGTG TTTATA6CAG 
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AAGTTATTTA TTTCTATGGC ATTCCAGCGG ATATTTTGGT 6TTTGC6AGG CATGCAGTCA 1140 



ATATTTTGTA CAGTTA6TGG ACAGTATTCA GCAACGCCTG ATAGCTTCTT TGGCCTTATG 1200 



TTAAATAAAA AGACCTGTTT GGGATGTAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1260 



AAAAA 



1265 



(2) INFORMATION FOR 5EQ ID NO: 170: 

(1) SEQUENCE CHARACTERISTICS: 
CA) IFNGTH: 383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS; single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: 5EQ ID N0:170: 



T6TAAGTCGA GCAGTGTGAT GACGATATTC TTCTTATTAA TGTGGTAATT GAACAAAT6A 60 



TCTGTGATAC TGATCCTGAG CTAGGAGGCG CTGTTCAGTT AATGGGACTT CTTCGTACTC 120 



TAATTGATCC AGAGAACATG CTGGCTACAA CTAATAAAAC CGAAAAAAGl GAATTTCTAA 180 



ATTTTTTCTA CAACCATTGT ATGCATGTTC TCACAGCACC ACTTTTGACC AATACTTCAG 240 



AAGACAAATG TGAAAAGGAT AATATAGTTG GATCAAACAA AAACAACACA ATTTGTCCCG 300 



ATAATTATCA AACAGCACAG CTACTTGCCT TAATTTTAGA GTTACTCACA TTTTGTGTGG 360 



AACATCACAC TGCTCGACTT ACA 



383 



(2) INFORMATION FOR SEO ID NO: 171: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 base pairs 

(B) TYPE: nucleic add 

(C) STRANOEDNESS: single 

(D) lOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 171; 

TGGGCACCTT CAATATCGCA AGTTAAAAAT AATGTTGAGT TTATTATACT TTTGACCTGT 60 

TTAGCTCAAC AG6GTGAAGG CATGTAAAGA ATGTGGACTT CTGAGGAATT HCTTTTAAA 120 

AA6AACATAA TGAAGTAACA TTTTAATTAC TCAA6GACTA CTTTTGGTTG AAGTTTATAA 180 

TCTAGATACC TCTACTTTTT GTTTTTGCT6 TTCGACAGTT CACAAAGACC TTCAGCAATT 240 

TACAGGGTAA AATCGTTGAA GTAGTGGAGG T6AAACTGAA AFTTAAAATT ATTCTGTAAA 300 

TACTATAGGG AAAGAGGCTG AGCTTAGAAT CTTTTGGTTG TTCATGTGTT CTGTGCTCTT 360 



ATCATCACAC TGCTCGACTT ACA 

(2) INFORMATION FOR SEO ID NO: 172; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
CD) TOPOLOGY: linear 



383 



(XT) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
TCGGGTGATG CCTCCTCAGG CTTGTCGTTA GTGTACACAG AGCTGCTCAT 6AAGCGACAG 60 



.e72S4a6A3_IB> 
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CGGCTGCCCC TGGCACTTCA GAACCTCTTC CTCTACACTT TTGGTGCGCT TCTGAATCTA 120 

GGTCTGCATG CTGGCGGCGG CTCTGGCCCA G6CCTCCTGG AAAGTTTCTC AGGAT6GGCA 180 

GCACTCGTGG IGCTGAGCCA GGCACTAAAT GGACTGCTCA TGTCTGCTGT CATGGAGCAT 240 

GGCA6CAGCA TCACACGCCI CTTTGTGGIG TCCT6CTCGC TGGTGGICAA CGCCGTGCTC 300 

TCAGCAGTCC TGCTACGGCT GCAGCTCACA GCCGCCTTCT TCCTGGCCAC ATTGCTCATT 360 

GGCCTGGCCA TGCGCCT6TA CTATGGCAGC CGCTA6TCCC TGACAACTTC CACCCTGATT 420 

CCGGACCCTG TAGATTGGGC GCCACCACCA 6A1CCCCC1C CCAGGCCTTC CTCCCTCTCC 480 

CATCAGCGGC CCTGTAACAA GIGCCTTGTG A6AAAAGCTG GAGAAGTGAG GGCAGCCAGG 540 

TTATTCTCTG GAGGTTGGTG GATGAA6GGG lACCCCTAGG AGATGT6AAG TGTGGGTTTG 600 

GTTAAGGAAA TGCTTACCAT CCCCCACCCC CAACCAAGTT NTTCCAGACT AAAGAATTAA 660 

GGTAACATCA ATACCTAGGC CT6AGGAGGC ATCACCCGA 699 
(2) INFORMATION FOR SEQ 10 NO: 173: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:173: 
TCGGGTGATG CCTCCTCAGG CCAGATCAAA CTTGGGGTTG AAAACTGTGC AAAGAAATCA 60 
ATGTCGGAGA AAGAATTTTG CAAAAGAAAA ATGCCTAATC AGTACTAATT TAATAGGTCA 120 
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CATTAGCAGT GGAAGAAGAA ATGTTGATAT TTTATGTCAG CTATTTTATA ATCACCAGAG 
TGCTTA6CTT CATGTAAGCC ATCTCGTATT CATTAGAAAT AAGAACAATT TTATTC6TCG 
GAAAGAACTT TTCAATTTAT AGCATCTTAA TTGCTCAGCA TTTTAAATTT TGATAAAGAA 
AGCTCCACTT TTGGCAGGAG TAGGGGGCA6 GGAGAGAGGA GGCTCCATCC ACAAGGACA6 
AGACACCAG6 GCCAGTAGGG TAGCTGGTGG CTGGATCAGT CACAACGGAC TGACTTATGC 
CATGAGAAGA AACAACCTCC AAATCTCAGT TGCTTAATAC AACACAAGCT CATTTCTTGC 
TCACGTTACA TGTCCTATGT AGATCAACAG CAGGTGACTC AGGGACCCAG GCTCCATCTC 
CATATGAGCT TCCATAGTCA CCAGGACACG GGCTCT6AAA GTGTCCTCCA TGCAGGGACA 
CATGCCTCTT CCTTTCATT6 GGCAGAGCAA GTCACHATG GCCAGAAGTC ACACTGCAGG 
GCAGTGCCAT CCTGCTGTAT GCCTGAGGAG GCATCACCCG A 
(2) INFORMATION FOR SEQ ID NO: 174: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:174: 
TCGGGTGATG CCTCCTCANG CCCCTAAATC AGAGTCCA6G GTCAGAGCCA CAG6AGACAG 
GGAAAGACAT AGATTTTAAC CGGCCCCCTT CAGGAGATTC TGAGGCTCAG TTCACTTTGT 



60 
120 
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TGCAGTTTGA ACAGAGGCAG CAAGGCTAGT GGTTAGGGGC ACGGTCTCTA AAGCTGCACT 180 

GCCTGGATCT GCCTCCCAGC TCTGCCAGGA ACCAGCTGCG TGGCCTTGAG CTGCTGACAC 240 

GCAGAAAGCC CCCTGTGGAC CCAGTCTCCT CGTCTGTAAG ATGAGGACAG GACTCTAGGA 300 

I ACCCTTTCCC TTGGTTTGGC CTCACTTTCA CAGGCTCCCA TCTTGAACTC TATCTACTCT 360 

'is' 

TTTCCTGAAA CCTTGTAAAA GAAAAAAGTG CTAGCCTGGG CAACATGGCA AAACCCTGTC 420 

TCTACAAAAA ATACAAAAAT TAGTTGGGTG TGGTGGCATG IGCCTGTAGT CCCAGCCACT 480 

TGGGAGGTGC TGAGGTGGGA GGATCACTTG AGCCCGGGAG GTGGAGGTTG CAGTGAGCCA 540 

AGATCATGCC ACTGCACTCC AGCCTGAGTA ATAGAGTAAG ACTCTGTCTC AAAAACAACA 600 

ACAACAACA6 TGAGTGTGCC TCTGTTTCCG GGTTGGATG6 GGCACCACAT TTATGCATCT 660 

CTCAGATTTG GACGCTGCAG CCTGAGGAGG CATCACCCGA 700 
(2) INFORMATION FOR SEO ID NO: 175: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: Single 

(D) TOPOLOGY; linear 

>^ 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

I 

TATAGGGCGA ATTGGGCCCG AGTTGCATGN TCCCGGCCGC CATGGCCGCG GGATTCGGGT 60 
GATGCCTCCT CA6GCTTGTC TGCCACAAGC TACTTCTCTG AGCTCAGAAA GTGCCCCTT6 120 
ATGAGGGAAA AT6TCCTACT GCACTGCGAA TTTCTCAGTT CCATTTTACC TCCCAGTCCT 180 
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CCTTCTAAAC CAGTTAATAA ATTCATTCCA CAAGTATTTA CT6ATTACCT GCTTGTGCCA 240 

GGGACTATTC TCAGGCTGAA GAAGGTGGGA GGGGAGGGCG GAACCTGAGG AGCCACCIGA 300 

GCCAGCTTTA TATTTCAACC ATGGCTGGCC CATCTGAGAG CATCTCCCCA CTCTCGCCAA 360 

CCTATCGGGG CATAGCCCAG GGATGCCCCC AGGCGGCCCA GGTTAGATGC GTCCCTTTGG 420 

CTTGTCAGTG A1GACATACA CCTTAGCT6C TTAGCTGGTG CTGGCCTGAG GAGGCATCAC 480 

CCGA 4g4 
(2) INFORMATION FOR SEQ ID NO: 176; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
(0) TOPOLOGY: linear 

(XT) SEQUENCE DESCRIPTION: SEQ ID N0:176: 

TCGGGTGATG CCTCCTCAGG GCTCAAGGGA TGAGAA6TGA CTTCTTTCTG GAGGGACCGT 60 

TCATGCCACC CAGGATGAAA ATGGATAGGG ACCCACTTGG AG6ACTTGCT GATATGTTTG 120 

6ACAAATGCC AGGTAGCGGA ATT6GTACTG GTCCAGGAGT TATCCAG6AT AGATTTTCAC 180 

CCACCATGGG AC6TCATCGT TCAAATCAAC TCTTCAATGG CCATGGGGGA CACATCATGC 240 

CTCCCACACA ATCGCAGTTT GGAGAGATGG GAGGCAAGTT TATGAAAAGC CAGGGGCTAA 300 

GCCA6GTCTA CCATAACCAG AGTCAGG6AC TCTTATCCCA GCTGCAAGGA CAGTCGAA6G 360 
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ATATGCCACC TCGGTTTTCT AAGAAAGGAC AGCTIAATGC AGATGA6ATT A6CCTGAGGA A2Q 

6GCATCACCC GA 432 

(2) INFORMATION FOR SLQ ID NO: 177: 

(i) SLQULNCE CHARACTERISTICS; 
CA) LbNGTH: 788 base pairs 

(B) FYPC: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:177: 

TA6CATGTTG AGCCCAGACA CAGTAGCATT TGTGCCAATT TCTGGTTGGA ATGGTGACAA 60 

CATGCTGGAG CCAAGTGCTA ACATGCCTTG GTTCAAGGGA TGGAAAGTCA CCCGTAAGGA 120 

TGGCAATGCC AGTGGAACCA CGCT6CTTGA GGCTCTGGAC TGCATCCTAC CACCAACTC6 180 

CCCAACTGAC AAGCCCTTGC GCCTGCCTCT CCAGGATGTC TACAAAAHG GTGGTATTGG 240 

TACTGTTCCT GTTGGCCGAG TGGAGACTGG TGTTCTCAAA CCCGGTATGG TGGTCACCTT 300 

TGCTCCAGTC AACGTTACAA CGGAAGTAAA ATCTGTC6AA ATGCACCATG AAGCTTTGAG 360 

T6AA6CTCTT CCT66GGACA ATGTGGGCTT CAATGTCAAG AAT6T6TCTG TCAAGGATGT 420 

TCGTCGTGGC AACGTTGCTG GTGACA6CAA AAATGACCCA CCAATGGAAG CAGCTGGCTT 480 

CACTGCTCAG 6T6ATTATCC TGAACCATCC AGGCCAAATA AGTGCCG6CT ATGCCCCT6T 540 

ATTGGATTGC CACACGGCTC ACATTGCATG CAAGTTTGCT GAGCTGAAGG AAAAGATTGA 600 

TCGCCGTTCT GGTAAAAAGC TGGAAGATGG CCCTAAATTC TTGAAGTCTG GTGATGCTGC 660 
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CATTGTTGAT ATGGTTCCTG GCAAGCCCAT GTGTGTTGAG AGCTTCTCAG ACTATCCACC 720 
TTTGGGTC6C TTTGCTGTTC GTGATATGAG ACAGACAGH GCGGTGGGTG TCTGGGCTCA 780 



ACATGCTA 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SCQUEMCE CHARACTERISTICS: 
(A) LENGTH: 786 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



788 



Cxi) SEQUENCE DESCRIPTION; SEQ ID NO:178: 

TAGCATGTTG AGCCCAGACA CCT6TGTTTC TGG6AGCTCT GGCAGTGGCG GATTCATAGG 60 

CACTTGGGCT GCACTTTGAA TGACACACTT GGCTTTATTA 6ATTCACTAG TTTTTAAAAA 120 

ATTGTTGTTC GTTTCTTTTC ATTAAAGGTT TAATCAGACA GATCA6ACAG CATAATTTTG 180 

TATTTAATGA CAGAAACGH GGTACATTTC TTCAT6AAT6 AGCTTGCATT CTGAAGCAAG 240 

AGCCTACAAA AGGCACHGT TATAAATGAA A6TTCTGGCT CTAGA6GCCA GTACTCTG6A 300 

GTTTCAGAGC AGCCAGTGAT TGTTCCAGTC AGTGATGCCT AGTTATATAG AGGAG6AGTA 360 

CACTGTGCAC TCTTCTAGGT GTAAGGGTAT GCAACTTTGG ATCTTAAAAT TCTGTACACA 420 

TACACACTTT ATATATATGT ATGTATGTAT GAAAACATGA AATTAGTTTG TCAAATATGT 480 

GTGTGTTTAG TATTTTAGCT TAGT6CAACT ATHCCACAT TATTTATTAA ATTGATCTAA 540 
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GACACTTTCT TGTTGACACC TTGAATATTA ATGTTCAAGG GTGCAATGTG TATTCCTJT A 60G 

GATTGTTAAA GCTTAATTAC TATGATTTGf AGTAAATTAA CTTTIAAAAT GTATTTGAGC 660 

CCTTCTGTAG TGTCGTAGGG CTCTTACAGG GTGGGAAAGA TTTTAATTTT CCAGTTGCTA 720 

ATT6AACAGT ATGGCCTCAT TATATATHT GATTTATA6G A6TTTGTGTC TGGGCiCAAC 780 

ATGCTA 786 
(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:179; 

TAGCATGTTG AGCCCAGACA CTGGTTACAA GACCAGACCT GCTTCCTCCA TATGTAAACA 60 

GCTTTTAAAA AGCCAGTGAA CCHHTAAT ACTTIGGCAA CCTTCTTTCA CAGGCAAAGA 120 

ACACCCCCAT CCGCCCCTTG TTTGGAGTGC AGAGTTTGGC TTT66TTCTT TGCCTTGCCT 180 

GGAGTATACT TCTAAHCCT GTTGTCCTGC ACAAGCTGAA TACC6AGCTA CCCACCGCCA 240 

CCCAGGCCAG GTTTCCACTC ATTTATTACT TTATGTTTCT GTTCCATTGC TGGTCCACAG 300 

AAATAAGTTT TCCTTTGGAG GAATGT6ATT ATACCCCTTT AATTTCCTCC THTGCTTTT 360 

TTTTAATATC ATTGGTATGT GTTTGGCCCA GAGGAAACTG AAAHCACCA TCATCTTGAC 420 

TGGCAATCCC ATTACCATGC TTTTTTTAAA AAACGTAATT TTTCHGCCT TACATTGGCA 480 
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GAGTACCCCT TCCTGGCTAC TGGCTTAATC TAGTCACTCA GTTTCTAGGT GGCATTAGGC b40 

ATGAGACCTG AAGCACAGAC TGTCTTACCA CAAAA6GTGA CAAGATCTCA AACCTTAGCC 600 

AAAGGGCTAT GTCAGGTTTC AATGCTATCT GCTTCTGTTC CTGCTCACTG ITCTGGATTT 660 

TGTCCTTCTT CATCCCTAGC ACCAGAATTT CCCAGTCTCC CTCCCTACCT TCCCTTGTTT 720 

TAAHCTAAT CTATCAGCAA AATAACTTTl CAAATGTTTT AACCGGTATC TCCATGTGTC 780 

TGGGCTCAAC ATGCTA 7gg 
(2) INFORMATION FOR 5E0 ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

GGATGTGCTG CAA6GCGATT AAGTTGGGTA ACGCCAGGGT TTTCCCAGTC ACGACGTTGT 60 

AAAACGACGG CCAGTGAATT GTAATAC6AC TCACTATAGG GCGAAHGGG CCCGACGTCG 120 

CATGCTCCC6 GCCGCCATGG CCGCGG6ATA GCATGTTGAG CCCAGACACC TGCAGGTCAT ISO 

TTGGAGAGAT TTTTCACGTT ACCAGCTTGA TGGTCTHTT CAGGAGGAGA GACACTGAGC 240 

ACTCCCAAG6 TGAGGTTGAA GATHCCTCT AGATAGCCGG ATAAGAAGAC TAGGAGGGAT 300 

GCCTAGAAAA T6ATTAGCAT GCAAATTTCT ACCTGCCATT TCAGAACTGT GTGTCAGCCC 360 
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ACATTCAGCl GCTTCTTGTG AACTGAAAAG AGA6AGGTAT TGAGACTTTT CTGATGGCCG 420 

CTCTAACATT GTAACACAGT AATCTGTGTG TGTGTGGGTG TGTGTGTGrC TCTGGGCTCA 480 

ACATGCTA 488 
C2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
([)) TOPOLtXiY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

TAGCATGTTG AGCCCAGACA C6GCGACGGT ACCTGATGAG TGGGGTGATG 6CACCTGTGA 60 

AAAGGAGGAA CGTCATCCCC CATGATATTG GGGACCCAGA TGATGAACCA TGGCTCCGCG 120 

TCAATGCATA TTTAATCCAT GATACTGCTG ATTGGAAGGA CCTGAACCTG AAGTTT6TGC 180 

TGCAGGTTTA TCGGGACTAT TACCTCACGG 6TGATCAAAA CTTCCTGAAG GACATGT66C ?40 

CTGTGTGTCT AGTAAGGGAT GCACATGCAG TGGCCAGTGT GCCAGGGGTA TGGTTG6TGT 300 

CTGGGCTCAA CATGCTA 317 
(2) INFORMATION FOR SEQ ID NO: 182: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1B2: 

lAGCATGTTG AGCCCAGACA CTGGCTGTTA GCCAAATCCT CTCTCAGCTG CTCCCTGTGG 60 

TTTGGTGACT CAG6ATTACA GAGGCATCCT GTTTCAGGGA ACAAAAAGAT TTTAGCTGCC 120 

AGCAGAGAGC ACCACATACA TTAGAATGGT AAGGACTGCC ACCTCCTTCA AGAACAGGAG 180 

TGAGGGTGGT GGT6AATGGG AATGGAAGCC TGCATTCCCT GATGCATTTG TGCTCTCTCA 240 

AATCCTGTCT TAGTCTTAGG AAAGGAAGTA AAGTTTCAAG GACGGTTCCG AACTGCTTTT 300 

TGTGTCT6GG CTCAACATGC TATCCCGCGG CCATGGCGGC CGGGAGCATG CGACGTCGGG 360 

CCCAATTCGC CCTATAGT6A GTCGTATTAC AATTCACTGG CCGTCGTTTT ACAACGTCGT 420 

GACTGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCCCA 480 

GCTGGCGTAA TANCGAAAAG GCCC6CA 507 
(2) INFORMATION FOR SEO ID NO: 183: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
GATTTACGCT GCAACACTGT GGAGGTAGCC CTGGAGCAAG GCAGGCATGG ATGCTTCTGC 60 
AATCCCCAAA T66AGCCT6G TATTTCAGCC AGGAATCTGA GCAGAGCCCC CTCTAATTGT 120 
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AGCAATGATA AGTTATTCTC TnGTTCTTC AACCTTCCAA TAGCCTTGAG CTTCCAGGGG 180 
AGTGTCGTTA ATCATTACAG CCTGGTCTCC ACAGTGTTGC AGCGTAA 271 
(2) INFORMATION FOR SEQ ID N0:184: 

(1) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 225 base pairs 

CB) TYPE: nucleic add 
CO STRANDEDNESS: single 
CD) lOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTACGCTGCA ACACTGTGGA 6CA6ATTAAC ATCAGACTTT TCTATCAACA TGACTGGGGT 60 

TACTAAAAAG ACAACAAATC AATGGCTTCA AAAGTCTAAG GAATAATTTC GATACTTCAA 120 

CTTTATAAAA CCT6ACAAAA CTATCAATCA AGCATAAAGA CAGAT6AAGA ACATTTCCAG 180 

ATTTTGGCCA ATCAGATATT TTACCTCCAC AGTGTTGCAG CGTAA 225 
(2) INFORMATION FOR SEQ ID NO: 18b: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 
GGCCCGACGT CGCATGCTCC CGGCCGCCAT G6CCGCGGGA TTCGTTAGGG TCTCTATCCA 60 
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CTGGGACCCA TAGGCTAGTC AGAGTATTTA GAGTTGAGTT CCTTTCTGCT TCCCAGAATT 120 
TGAAAGAAAA GGAGTGAGGT GATAGAGCTG AGAGATCAGA TTTGCdCTG AAGCCTGTTC 180 

AAGATGTATG TGCTCAGACC CCACCACTGG GGCCTGTGGG TGAGGTCCTG GGCATCTATT 240 

TGAATGAATT GCTGAAGGGG AGCACTATGC CAAGGAAGGG GAACCCATCC TGGCACTGGC 300 

ACA6GG6TCA CCTTA1CCAG TGCTCAGTGC TTCHTGCTG CIACCTGGTT TTCTCTCATA 360 

TGTGAGGGGC A6GTAA6AAG AAGTGCCCRG T6TTGTGCGA GTTTTAGAAC ATC7ACCAG1 420 

AAGTGGGGAA GTTTCACAAA GCAGCAGCTT TGTTTTGTGT ATTTTCACCT TCAGTTAGAA 480 

GAGGAAGGCT GTGAGATGAA TGTTAGTTGA GTGGAAAAGA CGGGTAAGCT TAGTGGATAG 540 

AGACCCTAAC GAATCACTAG TGCGGCCGCC TTGCAGGTCG ACCATATGGG AGAGCTC 597 
(2) INFORMATION FOR SEQ ID NO: 186: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:186: 

6GCCCGAAGT TGCATGTTCC CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 60 

CTACCTAAAA AATCCCAAAC ATATAACTGA ACTCCTCACA CCCAATTGGA CCAATCCATC 120 

ACCCCAGAGG CCTACAGATC CTCCTTTGAT ACATAA6AAA AHTCCCCAA ACTACCTAAC 180 
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TATATCATTT TGCAAGATTT GTTTTACCAA ATTTTGATGG CCTTTCTGAG CTTGTCAGTG 240 
TGAACCACTA TTACGAACGA TC6GATATTA ACTGCCCCTC ACCGTCCAGG TGTAGCTGGC 300 
AACATCAAGT GCAGTAAATA TTCATTAAGT TTTCACCTAC TAA6GTGCTT AAACACCCTA 360 

I 

I GGGTGCCATG TCGGTAGCAG ATCTTTTGAT TTGTTTTTAT TFCCCATAAG GGTCCTGTTC 420 

AAGGICAATC ATACATGTAG IGTGAGCAGC TAGTCACTAT CGCATGACTT GGAGGGTGAT 480 
AATAGAGGCC TCCTTTGCTG TIAAAGAACT CTTGTCCCAG CCTGTCAAAG TGGATAGA6A 540 
CCCTAAC6AA TCACTAGT6C GGCCGCCT6C AGGTCGACCA TAT6GGAGAG CTCCCAA 597 
(2) INFORMATION FOR SEQ ID NO: 187: 

I 

■ (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CO) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

TCGTTAGGGT CTCTATCCAC TT6CAGGTAA AATCCAATCC TGTGTATATC TTATAGTCTT 60 

CCATATGTAG TGGTTCAAGA GACTGCAGTT CCA6AAAGAC TAGCC6AGCC CATCCATGTC 120 

TTCCACTTAA CCCTGCTTTG GGTTACACAT CTTAACTTTT CTGTTCAAGT TTCTCTGTGT 180 

AGTTTATAGC ATGAGTATTG GGAWAATGCC CTGAAACCTG ACATGA6ATC TGGGAAACAC 240 

AAACTTACTC AATAAGAATT TCTCCCATAT TTTTAT6ATG GAAAAATTTC ACATGCACAG 300 

AGGAGTGGAT AGAGACCCTA ACGA 324 
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(2) INFORHATION FOR SEQ ID NO: 188: 

(i) SCOUlNCL CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 

(C) SIRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 



GCGCGGGGAT TCGGGGTGAT ACCTCCTCAT GCCAAAATAC AACGTNTAAT TTCACAACTT 60 



■J 



GCCTTCCAAT TTACGCATTT TCAATTTGCT CTCCCCATTT GTTGAGTCAC AACAAACACC 120 



ATTGCCCAGA AACATGTATT ACCTAACATG CACATACTCT TAAAACTACT CATCCCTT 



178 



(2) INFORMATION FOR SEQ ID NO: 189: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



TGACACCTTG TCCAGCATCT GACACAGTCT TGGCTCTTGG AAAATATTGG ATAAATGAAA 60 



ATGAATTTCT TTAGCAAGTG GTATAAGCTG AGAATATACG TATCACATAT CCTCATTCTA 120 



AGACACATTC AGT6TCCCT6 AAATTAGAAT AGGACTTACA ATAAGTGTGT TCACTHCTC 180 



AATAGCTGTT ATTCAATT6A TGGTAGGCCT TAAAAGTCAA AGAAATGAGA GGGCATGTGA 240 
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AAAAAAGCTC AACATCACTG ATCATTAGAA AACTTCCATI CAAACCCCCA ATGAGATACC 300 

ATCTCATACC AGTCAGAATG GCTATTATTA AAAAGTCAAA AAATAACAGA TGCTGGACAA 360 

GGTGTCA 367 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: Single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: 5E0 ID NO: 190: 

GACACCTTGT CCAGCATCTG ACAACGCIAA CAGCCT6AGG AGATCHTAT HATTTATTT 60 

AGTTTTTACT CTGGCTAGGC AGATGGTGGC TAAAACATTC ATHACCCAT TTATTCATTT 120 

AATTGTTCCT GCAAGGCCTA TGGATAGAGl ATIGTCCAGC ACTGCTCTGG AAGCTAGGAG 180 

CATGGGGATG AACAAGATAG GCTACATCCT GTTCCCACAG AACTTCCACT TTAGTCTGGG 240 

AAACAGATGA TATATACAAA TATATAAATG AATTCAGGTA GTTTTAAGTA CGAAAAGAAT 300 

AAGAAAGCAG AGTCATGATT TANAATGCTG GAAACAGGGG CTATTGCTTG AGATATTGAA 360 

GGTGCCCAA 369 

(2) INFORMATION FOR SEQ ID NO: 191: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 369 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SLQUENCF DESCRIPTION: SEC ID NO: 191: 
FGACACCTTG TCCAGCATCT GCACAGGGAA AAGAAACTAT TATCAGAGTG AACAGGCAAC 
CTACAGAATG GGAGAAAATT TTTGCAATCl ATCCATCT6A CAAAGGGCTA ATATCCAGAA 
TCTACAAAGA ACTTATACAA ATTTACAAGA AACAAACAAA CAAACAACTC CTCAAAAAGT 
GGGT6AAGGA TGTGAACAGA CACTTCTCAA AAGAAGACAT TTATGGGGCC AACAAACATA 
TGAAAAAAAG CTCATCATCA CTGGTCACTA 6ATAAATGCA AATCAAAACC ACAATGA6AT 
ACCATCTCAT TCCAGTTAGA ATGGCAATCA TTAAAAAGTC AGGAAACAAC AGATGCTGGA 



CAAGGTGTC 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



369 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:192: 
TGACGCTTGG CCACTTGACA CHCATCTTT GCACAGAAAA ACTTCTTTAC AGATTTAATT 
CAAGACTGGT CTAGTGACAG TCCTCCA6AC ATTTTTTCAT TTGTTCCATA TACGTGGAAT 
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TTTAAAATCA TGTTTCATCA GTTTGAAAT6 ATTTGGGCTG CTAATCAACA CAATTGGATC 180 



6ACTGTTCTA CTAAACAACA GGAAAATGTG TATCTGGCAG CCT6TGGAGA AACACTAAAC 240 



ATTGATTTTT CTTTGCCTTT TAC6GACTTT GTTCCAGCTA CATGTAATAC CAAGTTCTCT 300 



TTAAGAGGAG AAGATGTTGA TCTTCATTTG TTTCTACCAG ACTGCCACCC TAGTAAA1AT 360 



TCTTTATTTA TGCTG6TAAA AAATTGCCAT CCAAATAAGA TGATTCATGA TACTGGTATT 4?0 



CCTGCTGAGT GTCAAGTGGC CAAGCGTCA 



449 



(2) INFORMATION FOR SEQ ID NO: 193; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:193: 



TGACGCTTGG CCACTTGACA CCAGGGATGT AKCAGHGAA TATAATCCTG CAATTGTACA 60 



TATTGGCAAT TTCCCATCAA ACATTCTAGA AAGAGACAAC CAGGATTGCT AGGCCATAAA 120 



AGCTGCAATA AATAACTGGT AATTGCAGTA ATCATTTCAG GCCAAHCAA TCCAGTTTGG 180 



CTCAGAGGTG CCHTGGCTG AGAGAAGAGG T6AGATATAA TGTGTTHCT TGCAACTTCl 240 



TGGAAGAATA ACTCCACAAT AGTCTGAGGA CTAGATACAA ACCTATTTGC CATTAAAGCA 300 



CCAGAGTCTG TTAATTCCAG TACTGAIAAG TGTTGGAGAT TA6ACTCCAG TGTGTCAAGT 360 



GGCCAAGCGT CA 



372 
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(2) INFORMATION FOR SEQ ID HO.W: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 309 base pairs 
(R) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:194: 



TGACGCTTGG CCACTTGACA CTTATGTAGA ATCCATCGT6 GGCTGATGCA AGCCCTTTAl 60 



TTAGGCTTAG TGTTGT6GGC ACCTTCAATA TCACACTA6A GACAAACGCC ACAAGATCTG 120 



CAGAAACATT CAGTTCT6AN CACTCGAATG GCAGGATAAC TTTTTGT6TT GTAATCCTTC 180 



ACATATACAA AAACAAACTC TGCANTCTCA CGTTACAAAA AAACGTACTG CTGTAAAATA 240 



TTAAGAAGGG GTAAAGGATA CCATCTATAA CAAAGTAACT TACAACTAGT 6TCAAGTGGC 300 



CAAGCGTCA 



309 



(2) INFORMATION FOR SEQ ID NO: 195: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:195: 



TGACGCTTGG CCACTTGACA CCCAATCTCG CACTTCATCC TCCCAGCACC T6AT6AA6TA 60 
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GGACTGCMC TATCCCCACT TCCCAGATGA GGGGACCAAN GtACACATTA GGACCCGGAT 120 

GGGAGCACAG ATTTGTCCGA TCCCAGACTC CAAGCACTCA GCGTCACTCC AGGACAGCGG 180 

CTTfCAGATA AGGTCACAAA CATGAATGGC TCCGACAACC GGAGTCAGTC CGTGCTGAGT 2/)0 

TAAGGCAATG GTGACACGGA TGCACGTGTN ACCTGTAATG GTTCATCGTA AGTGTCAAGT 300 
GGCCAAGCGT CA 312 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

TGTATCGAC6 TAGTGGTCTC CTCAGCCATG CAGAACTGTG ACTCAATTAA ACCTCHTCC 60 

TTTATGAATT ACCCAATCTC GGGTAGTGTC TTTATAGTAG TGTGAGAATG GACTAATACA 1?0 

AGTACATTTT ACTTAGTAAT AATAATAAAC AAATATATTA CATTTTTGTG TATTTACTAC 180 

ACCATATTTT TTATTGTTAT TGTAGTGTAC ACCTTCTACT TATTAAAAGA AATAGGCCCG 240 

AGGCGG6CAG ATCACGAGGT CAGGAGATGG AGACCACTAC GTCGATAC 288 

(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 289 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY; linear 

Cxi) SEQUENCE DESCRIPTION: SEQ 10 NO: 197: 

TTGGGCACCT TCAATATCAT GACAGGTGAT GTGATAACCA AGAAG6CTAC TAAGTGATTA 60 

ATGGGTGG6T AATGTATACA GAGTAG6TAC ACTGGACAGA 6GGGTAATTC ATAGCCAAG6 120 

CAGGAGAAGC AGAATGGCAA AACATTTCAT CACACTACTC AGGATAGCAT GCAGTTTAAA 180 

ACCTATAAGT AGTTTATTTT TGGAATTTTC CACTTAATAT TTTCAGACTG CA6GTAACTA 240 

AACTGTGGAA CACAAGAACA TAGATAA6GG GAGACCACTA CGTCGATAC 289 
(2) INFORMATION FOR SEO ID N0:198: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS; single 
CD) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

GTATCGACGT AGTGGTCTCC CAAGCAGTGG 6AAGAAAACG TGAACCAATT AAAATGTATC 60 

AGATACCCCA AAGAAAGGCG CTTGAGTAAA GATTCCAAGT GGGTCACAAT CTCAGATCTT 120 

AAAATTCAGG CTGTCAAAGA GATHGCTAT 6AGGTTGCTC TCAATGACTT CAGGCACAGT 180 

CGGCAGGAGA TT6AAGCCCT GGCCATTGTC AAGATGAAGG AGCTTTGTGC CAT6TATGGC 240 
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AAGAAAGACC CCAATGAGCG GGACTCCTGG AGACCACTAC GTCGATAC 



288 



(2) INFORMATION FOR SEQ ID NO: 199: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nuclCK acid 

(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



GCTTTnCGG AAAAACNCAA NT6GGGGAAA 6GGGGNTTNN TNGCAAGGG6 ATAAAGGGG6 60 



AANCCCAGGG TTTCCCCATI CAGGGAGGTC TAAAAAGNCG GCCAGG6GAT TGTAANAGGA 120 



TTCAATAATA GGGGGAATGG GCCCNGAAGT TGCAAGGTTC CNGCCCGCCA TGNCCGCGGG 180 



ATTTAGTGAC ATTACGACGS TGGTAATAAA GTGGGSCCAA WAAATATTTG TGATGTGATT 240 



TTTSGACCAG TGAACCCATT GWACAGGACC TCATTTCCTY TGAGATGRTA GCCATAATCA 300 



GATAAAAGRT TAGAAGTYTT TCTGCACGTT AACAGCAKA TTAAATGGAG TGGCATCACC 360 



AATTTCACCC TTTGTTAGCC GATACCTTCC CCTTGAAGGC ATTCAATTAA GTGACCAATC 420 



GTCATACGAG AGGGGAIGGC ATGGGGATTG ATGATGATAT CAGGGGTGAT ACCTTCACAG 480 



GT6AAAGGCA TATCCTCTTG TCTATACTGA ATACCACAAG TACCCTTTTG ACCATGTCGA 540 



CTAGCAAATT TGTCTCCAAT CTGTGTWATC CCTAACAGAG CGTACCCTTA TTTTACAAAA 600 



TTTATATCCT TCCT6ATTGA GAGTTACCAT AACCT6ATCC ACAATGCCCG TCTCGCTWGT 660 



TCTGAGAAAA GTGCTACAGT CTCTCTT6GT ATAGCGTCTA TTGGTGCTCT CCAATTCATC 720 



NSDOCID: <WO__B7254aeA3JB> 



I 



I 



wo 97725426 PCT/US97/00485 

169 

TTCATTTTTC AGGCMGGT6 AACTGTTTT6 CCTAIAATAA CMTCATCTCC TGATACMC6A 780 

AACCCCKGGA RCTATCAAAC CATCATCATC CAGC6TTCKT WATGTYMCTA AATCCCTATT 840 

GC6GCCGCCT GCAGGTCAAC ATATN6GAAA ACCCCCCACC CCTTN6GAGC NTACCTTGAA 900 

TTTTCCATAT GTCCCNTAAA TTANCTNGNC TTANCCTGGC CNTAACCTNT TCCGGTTTAA 960 

ATTGTTTCCG CCCCCNTTCC CCNCCTTNNA ACCGGAAACC TTAATTTTNA ACCNGGGGTT 1020 

CCTATCC ^P27 
(?) INFORMATION FOR SEQ [D N0:200: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: sing]e 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

AGIGACATTA CGACGCTGGC CATCTTGAAT CCTAG66CAT GAAGTTGCCC CAAAGTTCAG 60 

CACTTGGTTA AGCCTGATCC CTCTGGTTTA TCACAAAGAA TAG6ATGGGA TAAAGAAAGT 120 

GGACACTTAA ATAAGCTATA AATTATATGG TCCTTGTCTA GCAGGAGACA ACTGCACAGG 180 

TATACTACCA GCGTCGTAAT GTCACTA 207 

(2) INFORMATION FOR SEQ ID N0:201: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTfl: 209 base pairs 
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(B) TYPC: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:201: 

TGGGCACCTT CAATATCTAT TAAAAGCACA AATACTGAA6 AACACACCAA GACTATCAAT 60 

GAGGriACAT CTGGAGTCCT CGATATATCA G6AAAAAATG AAGTGAACAT TCACAGAGTT 120 

TTACTTCTTT G6GAACTCAA ATGCTAGAAA AGAAAAGGGT GCCCTCTTTC TCTGGCTTCC 180 



TGGTCCTATC CAGCGTCGTA ATGTCACTA 
(2) INFORMATION FOR SEO ID N0:202: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



209 



(XI) SEQUENCE DESCRIPTION: SEO ID NO:202: 

NTACGCTGCA ACACTGTGGA GCCACTG6TT HTATTCCCG GCAGGHATC CAGCAAACAG 60 

TCACTGAACA CACCGAAGAC CGTGGTATGG TAACC6TTCA CAGTAATCGT TCCAGTCGTC 120 

TGCGGGACCC CGACGA6CGT CACTGGGTAC AGACCAGAU CAGCCGGAAG AGAAAGCGCC 180 

GCA6GGAGAG ACTCGAACTC CACTCCGCTG GTGAGCAGCC CCATGTTTTC AACTC6AA6T 240 

TCAAACGGCA HGGGTTATA TACCATCAGC TGAACTTCAC ACACATCTCC TTGAACCCAC 300 
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TGGAAATCTA TTTTCTTGTT CCGCTCTTCT CCACAGTGTT GCA6CGTAA 3^9 
{2) INFORMATION FOR SEQ ID NO:203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:203: 

TGCTCCTCTT GCCTTACCAA CCCAAAGCCC ACT6T6AAAT AT6AAGTGAA T6ACAAAATT 60 

CAGTTTTCAA CGCAATATAG TATAGTTTAT CTGATTCTTT TGATCTCCAG 6ACACTTTAA 120 

ACAACTGCTA CCACCACCAC CAACCTAG6G ATTTAGGATT CTCCACAGAC CAGAAATTAT 180 

TTCTCCTTTG AGTTTCAGGC TCCTCTGGGA CTCCTGTTCA TCAATGGGTG GTAAATGGCT 240 

^ 241 
(2) INFORMATION FOR SEQ ID N0:?04: 

(1) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:204; 
TAGCCATTTA CCACCCATCT GCAAACCSWG ACMWWCARGR CYWGWACKYA GGCGAHTGA 60 
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AGTACTGGTA ATGCTCTGAT CATGTTAGTT ACATAAGT6T GGTCAGHTA CAAAAATTCA 120 

CA6AACTAAA TACTCAATGC TATGTGTTCA TGTCTGTGTT TAT6TGTGTG TAATGTTTCA 180 

ATTAAGTTTT TTTAAAAAAA AGAGATGATT TCCAAAfAAG AAAGCCGTGi TGGTAAGGCA 2^0 

AGAGGAGC 248 
(2) INFORMATION FOR SEQ ID N0:205: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic ^c^6 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:205: 

TACGCTGCAA CACTGTGGAG CCATTCATAC AG6TCCCTAA TTAAGGAACA AGTGATTATG 60 

CTACCTTTGC ACGGTTAGGG TACCGCGGCC GTTAAACATG TGTCACTGG6 CAG6CGGTGC 120 

CTCTAATACT GGTGATGCTA GAG6TGATGT TTTTGGTAAA CAGGCGGGGT AAGATTTGCC 180 

GAGTTCCTTT TACTTTTTTT AACCTTTCCT TAT6AGCATG CCTGTGTTGG 6TTGACAGTG 240 

GGGGTAATAA TGACTTGTT6 GTTGATTGTA GATATT6GGC T6TTAATTGT CAGTTCA6TG 300 

TTTTAATCTG ACGCAGGCTT ATGCGGAGGA GAATGTTTTC ATGHACTTA TACTAACATT 360 

AGTTCTTCTA TAGGGTGATA GATTGGTCCA ATTGGGTGTG AGGAGTTCAG TTATATGTTT 420 

GGGATTTTTT AG6TAGTGGG TGTTGANCTT GAACGCTTTC TTAATTGGTG GCTGCTTTTA 480 

R6CCTACTAT GGGTGGTAAA TGGCT 50b 
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(2) INFORMATION FOR SEO ID N0:206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO:206: 
TAGACTGACT CATGTCCCCT ACCAAAGCCC ATGTAAG6AG CTGAGTTCTT AAAGACTGAA 
GACAGACTAT TCTCTG6AGA AAAATAAAAT GGAAATTGTA CTTTAAAAAA AAAAAAAATC 
GGCCGGGCAT GGTAGCACAC ACCTGTAATC CCAGCTACTA GGG6ACATGA GTCAGTCTA 
(2) INFORMATION FOR SEQ ID NO: 207: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 
AGACTGACTC ATGTCCCCTA CCCCACCTTC TGCTGTGCTG CCGTGHCCT AACAGGTCAC 
AGACTGGTAC TGGTCAGTGG CCTGGGGGH GGGGACCTCT ATTATATGGG ATACAAATTT 
AGGAGTTGGA ATTGACACGA TTTAGT6ACT GATGG6ATAT GGGTG6TAAA TGGCTA 
(2) INFORMATION FOR SEQ ID N0:208: 
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(i) SEQUENCt CHARACTERISTICS: 
(A) LENGTH: 196 base pairs 
(f3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:208: 

AGACTGACTC ATGTCCCCTA TTTAACAGGG TCTCTAGTGC TGTGAAAAAA AAAAATGCTG 60 

AACATTGCAT ATAACTTATA TTGTAAGAAA TACTGTACAA TGACTTTATT GCATCTGGGT 120 

AGCTGTAAGG CATGAA6GAT GCCAAGAAGT TTAAGGAATA TGGGTGGTAA ATGGCTAGGG 180 



GACAT6AGTC AGTCTA 
(2) INFORMATION FOR SEQ ID N0:209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 



196 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:209: 

GACGCTTGGC CACTT6ACAC CTHTATTTT TTAA6GATTC TTAAGTCATT TANGTNACTT 60 

TGTAAGTTTT TCCTGTGCCC CCATAAGAAT GATAGCTTTA AAAATTATGC TGGGGTAGCA 120 

AAGAAGATAC TTCTAGCTTT AGAATGTGTA GGTATA6CCA GGATTCTTGT GAGGAGGGGT 180 

GATTTAGAGC AAATTTCTTA TTCTCCTT6C CTCATCTGTA ACATGGGGAT AATAATAGAA 240 
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CTGGCTTGAC AAG8TTGGAA TTAGTATTAC ATGG7AAATA CATGTAAAAT GTTTAGAATG 300 
GTGCCAAGTA TCTAGGAAGT ACTTGGGCAT GGGTGGTAAA TGGCT 34(5 
(2) INFORMATION FOR SEQ ID N0:210: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ 10 N0:210: 
GACGCTTGGC CACTTGACAC TAGAGTAGGG TTTGGCCAAC TTTTTCTATA AAGGACCAGA 60 
GAGTAAATAT TTCAGGCTTT GTGGGTTGTG CAGTCTCTCT TGCAACTACT CAGCTCTGCC 120 
ATTGTA6CAT AGAAATCAGC CATAGACAGG ACAGAAATGA ATG6GTGGTA AATG6CTA 178 
(2) INFORMATION FOR SEQ ID N0;2I1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:211: 
TGGGCACCTT CAATATCTAT CCAGCGCATC TAAATTCGCT TTHTCTTGA TIAAAAATTT 60 
CACCACTTGC TGTTTHGCT CAT6TATACC AAGTAGCAGT GGTGTGAGGC CATGCTTGTT 120 
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rTHGATTCG ATATCAGCAC CGTATAAGAG CAGTGCTTTG GCCATTAATT TATCTTCATT 180 



GTAGACAGCA TAGTGTAGAG TGGTATCTCC ATACTCATCT GGAATATTTG GATCAGTGCC 240 



ATGTTCCAGC AACATIAACG CACAHCATC TTCCTGGCAT TGTACGGCCl TTGTCAGAGC 300 



TGTCCTCTTT TTGTTGTCAA GGACATTAAG TTGACATCGT CTGTCCAGCA CGAGTTTTAC 360 



TACTTCTGAA TTCCCATTGG CAGA6GCCAG ATGTAGAGCA 6TCCTCTTTT GCTTGTCCCT 4?0 



CTTGTTCACA TCAGTGTCCC TGAGCATAAC GGAA 



454 



(2) INFORMATION FOR SEO ID N0:212; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: oucIeK acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:212: 



TCCGTTAT6C CACCCAGAAA ACCTACT6GA GTTACTTATT AACATCAAGG CTGGAACCTA 60 



TTTGCCTCAG TCCTATCTGA TTCATGAGCA CATGGTTATT ACT6ATCGCA TTGAAAACAT 120 



TGATCACCTG GGTTTCTTTA TTTATCGACT GTGTCATGAC AAGGAAACTT ACAAACTGCA 180 



ACGCAGAGAA ACTATTAAAG GTATTCAGAA ACGTGAAGCC AGCAATTGTT TC6CAATTCG 240 



GCATTHGAA AACAAATTTG CCGTGGAAAC TTTAATHGT TCTTGAACAG TCAAGAAAAA 300 



CATTATTGAG GAAAATTAAT ATCACAGCAT AACGGAA 



337 
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(2) INFORMATION FOR SEQ ID N0:213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:213: 

TCGGGTGATG CCTCCTCAGG CATCTTCCAT CCATCTCTTC AAGATTA6CT GTCCCAAATG GO 

TTTTTCCTTC TCTTCHTAC TGATAAATTT GGACTCCTTC TTGACACTGA TGACAGCHT 120 

AGTATCCTTC TTGTCACCTT GCAGACTTTA AACATAAAAA TACTCATTGG TTTTAAAAGG 180 

AAAAAAGTAT ACATTAGCAC TATTAAGCTT G6CCTTGAAA CATTTTCTAT CTTTTATTAA 240 

ATGTCGGTTA GCTGAACAGA ATTCATTTTA CAATGCA6AG TGAGAAAAGA AGGGAGCTAT 300 

ATGCATTTGA GAATGCAAGC ATTGTCAAAT AAACAHTTA AATGCTTTCT TAAAGTGAGC 360 

ACATACAGAA AlACATTAAG ATATTAGAAA GTGTTTHGC TTGTGTACTA CTAATTAGGG 420 

AAGCACCTTG TATAGTTCCT CHCTAAAAT TGAAGTAGAT TTTAAAAACC CATGTAATTT 480 

AATTGAGCTC TCAGTTCAGA TTTTAGGAGA ATTTTAACAG GGATTTGGTT TTGTCTAAAT 540 

TTTGTCAATT TNTTTAGTTA ATCTGTATAA TTTTATAAAT 6TCAAACTGT ATTTA6TCC6 600 

TTTTCATGCT GCTATGAAAG AAATACCCAN GACAGGGTTA TTTATAAAN6 GAAAGANGTT 660 

AATTTGACTC CCAGTTCACA GGCCTGA6GA NGNATCNCCC 6AAATCCTTA TTGCG 715 
(2) INFORMATION FOR SEQ ID N0:214: 
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(i) SlQUENCE CHARACTERISTICS; 

(A) LENGTH: 345 base pairs 
4 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
1 (D) TOPOLOGY: linear 

I 

(xi) SLQUENCE DESCRIPTION: SEO ID NO: 214: 

GGTAANGNGC AlACNTCGGl GCTCCGGCCG CCGGAGTCGG GGGATTCGGG TGATGCCTCC 60 

TCAGGCCCAC TTGGGCCTGC TTTTCCCAAA TGGCAGCTCC TCTGGACATG CCATTCCTTC 120 

TCCCACCTGC CTGATTCTTC ATATGTTGGG TGTCCCTGTT TTTCTGGTGC TATTTCCTGA 180 

V CTGCTGTTCA GCTGCCACTG TCCTGCAAAG CCTGCCTTTT TAAATGCCTC ACCATTCCTT 240 

CATTTGTTTC TTAAATATGG GAAGTGAAAG TGCCACCTGA GGCCGGGCAC A6TGGCTCAC 300 

GCCT6TAATC CCAGCACTH 6GGAGCCTGA G6A6GCA7CA CCCGA 345 
(2) INFORMATION FOR SEO ID NO: 215: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

4 (C) STRANDEDNESS: sinqle 

CD) TOPOLOGY: linear 

I 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:215: 
GGTGATGCCT CCTCAGGCGA AGCTCAGGGA GGACAGAAAC CTCCCGTGGA GCA6AAGGGC 60 
. AAAAGCTCGC TTGATCTTGA HTTCAGTAC GAATACAGAC CGT6AAAGCG GGGCCTCACG 120 
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ATCCTTCTGA CCTTTTGGGT TTTAAGCAGG AGGT6TCAGA AAAGHACCA CAGGGATAAC 180 

TGGCTTGTGG CGGCCAA6CG TTCATAGCGA CGTCGCTTTT TGATCCTTCG ATGTCGGCTC 2^10 

TTCCTATCAT TGTGAAGCAG AATTCACCAA GCGTTGGATT GTTCACCCAC TAATAGGGAA 300 

CGTGAGCTGG GTTTAGACCG TC6TGAGACA GGTTAGTTTT ACCCTACTGA T6ATGTGTKG 360 

TTGCCATGGT AATCCTGCTC AGTACGA6AG GAACCGCAGG TTCASACATT TGGTGTATGT 420 

GCTTGCCTT 429 
(2) INFORMATION FOR SEQ ID N0:216; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:216: 

TGACACCTAT GTCCNGCATC TGTTCACAGT TTCCACAAAT AGCCAGCCTT TGGCCACCTC 60 

TCTGTCCT6A GGTATACAAG TATATCAGGA GGTGTATACC TTCTCTTCTC TTCCCCACCA 120 

AAGAGAACA1 GCAG6CTCTG 6AA6CTGTCT TAG6AGCCTT TGGGCTCAGA ATTTCAGAGT 180 

CTTGGGTACC TTGGATGTGG TCTGGAAGGA GAAACATTG6 CTCTGGATAA GGAGTACAGC 240 

CGGAGGAGGG TCACAGAGCC CTCAGCTCAA GCCCCTGTGC CTTAGTCTAA AAGCAGCTTT 300 

GGATGAGGAA GCAGGTTAAG TAACATACGT AAGCGTACAC AGGTAGAAAG TGCTGGGAGT 360 
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CA6AATTGCA CAGT6TGTAG GAGTAGIACC TCAATCAATG AG6GCAAATC AACTGAAAGA 420 

AGAAGACCNA TIAATGAATT GCTTANG6GG AAG6ATCAAG GCTATCATG6 AGATCTTTCT 480 

AGGAAGAT1A TTGTTTANAA TTATGAAAGG ANTA66GCAG 6GACAGGGCC AGAAGTANAA 540 

i 

I GANAACATTG CCTATANCCC TT6TCTTGCA CCCAGATGCT GGACAAGGTG TCA 593 



i 



(2) INFORMATION FOR SEQ ID N0:217: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 335 base pairs 
(D) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID N0;217: 

TGACACCTTG TCCAGCATCT GACGTGAAGA TGAGCAGCTC AGAGGAGGTG TCCTGGATTT 60 

CCTGGHCTG TGGGCTCCGT GGCAATGAAT TCTTCTGTGA AGTG6ATGAA GACTACATCC 120 

AGGACAAATT TAATCTTACT GGACTCAATG AGCAGGTCCC TCACTATCGA CAAGCTCTAG 180 

ACATGATCTT GGACCTGGAG CCTGATGAAG AACTGGAA6A CAACCCCAAC CAGAGTGACC 240 

TGATTGAGCA GGCAGCCGAG ATGCTTTATG GATTGATCCA CGCCCGCTAC ATCCTTACCA 300 

ACCGTGGCAT C6CCCAGATG CTGGACAAGG TGTCA 335 
(2) INFORMATION FOR SEQ ID N0:2I8; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 248 base pairs 

(B) TYPE; nucleic acid 
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(C) STRANOEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:218: 

TACGTACTGG TCTTGAAGGT CTTAGGTAGA GAAAAAATGT GAATATTTAA TCAAAGACTA 60 

TGTATGAAAT GGGACTGTAA GTACAGAGGG AAGGGTGGCC CTTATC6CCA GAAGTTGGTA 120 

GATGCGTCCC CGTCATGAAA TGTTGTGTCA CTGCCCGACA TTT6CCGAAT TACTGAAATT 180 

CCGTAGAATT AGTGCAAATT CTAACGTTGT TCATCTAAGA TTATGGTTCC ATGHTCTAG ?40 

TACTTTTA 248 
(2) INFORMATION FOR SEO ID N0:219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ 10 N0:219: 

TGACGCTTGG CCACTTGACA CAAGTAGGGG ATAAGGACAA AGACCCATNA GGTGGCCTGT 60 

CAGCCTTTTG TTACTGTTGC TTCCCTGTCA CCACG6CCCC CTCTGTAGGG GTGTGCTGTG 120 

CTCTGTGGAC ATTGGTGCAT TTTCACACAT ACCAHCTCT TTCTGCTTCA CAGCAGTCCT 180 

GAGGCGGGA6 CACACAGGAC TACCTTGTCA GATGANGATA ATGATGTCTG GCCAACTCAC 240 

CCCCCAACCT TCTCACTAGT TATANGAAGA GCCANGCCTA NAACCTTCTA TCCTGNCCCC 300 
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TTGCCCTATG ACCTCATCCC T6TTCCA1GC CCTATTCTGA TTTCTGGTGA ACTTTGGAGC 360 

AGCCTGGTTT NTCCTCCTCA CTCCAGCCTC TCTCCATACC ATGGTANGGG GGTGCTGTTC 420 

CACNCAAANG GTCAG6TGT6 TCTGGGGAAT CCTNANIANCT 6CCNGGAGTT TCCNANGCAT 480 

TCTTAAAAAC CTTCTTGCCT AATCANATNG TGTCCAGTGG CCAACCNTCN 530 
(2) INFORMATION FOR SEQ ID NO; 220: 

(1) SEQULNCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:220: 

T6ACGCTTGG CCACTTGACA CTAAATAGCA TCTTCTAAAG GCCTGATTCA GAGTTGTGGA 60 

AAATTCTCCC AGTGTCAGGG ATTGTCA6GA ACAGGGCTGC TCCTGTGCTC ACTTTACCTG 120 

CTGTGTTTCT GCTGGAAAA6 GAGGGAAGAG GAATGGCTGA TTTTTACCTA ATGTCTCCCA 180 

GTnnCATA TTCTTCTTGG ATCCTCTTCT CTGACAACT6 TTCCCTTTTG GTCTTCTTCT 240 

TCTTGCTCAG A6AGCAGGTC TCTTTAAAAC T6AGAAGGGA 6AATGAGCAA ATGAITAAAG 300 

AAAACACACT TCTGAGGCCC AGAGATCAAA TATTAGGTAA ATACTAAACC GCTTGCCTGC 360 

TGTGGTCACT TTTCTCCTCT TTCACATGCT CTATCCCTCT ATCCCCCACC TATTCATATG 420 

GCHTTATCT GCCAAGTTAT CCGGCCTCTC ATCAACCTTC TCCCCTAGCC TACTG6GGGA 480 
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TATCCATCTG GGTCTGTCTC TGGTGTATTG GTGTCAAGTG GCCAAGCGTC A 
(2) INFORMATION FOR SFQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

ATTGACGCTT GGCCACTT6A CACCCGCCTG CCTGCAATAC TGGG6CAAGG GCCHCACTG 60 

CTTTCCT6CC ACCAGCTGCC ACTGCACACA GAGATCAGAA ATGCTACCAA CCAAGACTGT 120 

TGGTCCTCA6 CCTCTCTGAG 6AGAAAGAGC AGAAGCCTGG AAGTCAGAAG AGAAGCTAGA 180 

TCG6CTAC6G CCTTGGCAGC CAGCTTCCCC ACCTGTGGCA ATAAAGTCGT GCATGCCHA 240 

ACAATGGGGG CACCTCCTGA GAAACACATT GTTAGGCAAT TCG6CGTGTG TTCATCAGAG 300 

CATATTTACA CAAACCTCGA lAGTGCAGCC TACTATCCAC TAHGCTCCT ACGCTGCAAA 360 

CCTGAACAGC ATGGGACT6T ACTGAATACT GGAAGCAGCT GGTGATGGTA CTTATTTGTG 420 

TATCTAAACA CAGAGAAGGT ACAGTAAGAA TAT6GTATCA TAAACTTACA GGGACCGCCA 480 

TCCTATATGC AGTCTGHGT GACCAAAATG T6TCAAGTGG CCAAGCGTCA 530 
(2) INFORMATION FOR SEQ ID N0:222; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 
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CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:222: 

TGTATCGACG TAGTGGTCTC CGGGCTACTA GGCCGTTGTG TGCTGGTAGT ACCTGGHCA 60 

CTGAAAGGCG CATCTCCCTC CCCGCGTCGC CCTGAAGCAG 6GGGAGGACT TCGCCCAGCC 120 

AAGGCAGTTG TAT6A6TTTT AGCT6CGGCA CTTCGAGACC TCTGAGCCCA CCTCCTTCAG 180 

GA6CCTTCCC C6ATTAAGGA AGCCAGGGTA AGGATTCCTT CCTCCCCCAG ACACCACGAA 240 

CAAACCACCA CCCCCCCTAT TCTG6CAGCC CATATACATC AGAACGAAAC AAAAATAACA 300 

AATAAACNAA AACCAAAAAA AAAAGA6AAG GGGAAATGTA TATGTCTGTC CATCCTGHG 360 

CTTTAGCCTG TCAGCTCCTA NAGGGCA6GG ACC6TGTCTT CCGAATGGTC TGTGCAGCGC 420 

CGACTGCGGG AAGTATCGGA G6AGGAAGCA 6AGTCAGCAG AAGTTGAACG GTGG6CCCGG 480 

CGGCTCTTGG 6GGCTGGTGT TGTACnCGA 6ACCGCTTTC GCHTTIGTC TTAGATTTAC 540 

GTTT6CTCTT TGGAGTGGGA NACCACTACN TCNATACA 578 

(2) INFORMATION FOR SEQ ID N0:223: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0;223: 
TGTATCGACG TAGTGGTCTC CTCTTGCAAA GGACTGGCTG GTGAATGGTT TCCCTGAATT 60 
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ATGGACTTAC CCTAAACATA TCTTATCATC AnACCAGTT GCAAAATATT AGAATGTGTT 120 
GTCACTGTTT CATTTGATTC CTAGAAGGTT A6TCTTAGAT ATGTTACTTT AACCTGTATG 180 

CTGTAGTGCT TTGAATGCAT TTTTTGTTTG CATTTTTGTT TGCCCAACCT GTCAATTATA 240 

GCTGCTTAGG TCTGGACTGT CCTGGATAAA GCTGTTAAAA TATTCACCAG TCCAGCCATC 300 

TTACAAGCTA ATTAAGTCAA CTAAATGCTT CCTTGTTTTG CCAGACTTGT TATGTCAATC 360 

CTCAATTTCT GGGTTCATTT TGGGTGCCCl AAATCTTAGG GTGTGACTTT CTTAGCATCC 420 

TGTAACATCC ATTCCCAAGC AAGCACAACT TCACATAATA CHTCCAGAA GTTCATTGCT 480 

GAAGCCTTTC CTTCACCCAG CGGAGCAACT TGATTTTCTA CAACTTCCCT CATCAGAGCC 540 

ACAAGAGTAT GGGATATGGA GACCACTACG TCGATACA 578 
(2) IMFORMATION FOR SEQ ID NO: 224: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:224: 

TGTATCGAC6 TANTGGTCTC CCAAGGTGCT GGGATTGCAG GCATGAGCCA CCACTCCCAG 60 

GTGGATCTTT nCTTTATAC HACTTCATT AGGTHCTGT TAHCAAGAA GTGTAGTGGT 120 

AAAAGTCTTT TCAATCTACA TGGTTAAATA ATGATAGCCT GG6AAATAAA TAGAAATHT 180 

TTCTTTCATC TTTAGGTTGA ATAAAGAAAC AGAAAAAATA GAACATACTG AAAATAATCT 240 
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AAGTTCCAAC CATAGAAGAA CTGCAGAA6A AATGAAGAAA GTGATGATGA TTTAGATTTl 300 
GATATTGATT TAGAA6ACAC AGGAGGAGAC CACTAC6TCG ATACA 34b 
(2) INFORMATION FOR SEO ID N0:225; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:225; 

TGTATCGACG TAGTGGTCTC CAAACTGAGG TATGTGTGCC ACTA6CACAC AAAGCCTTCC 60 

AACAGGGACG CAGGCACAGG CAGTTTAAAG GGAATCTGTT TCTAAATTAA TTTCCACCTT 120 

CTCTAAGTAT TCTTTCCTAA AACTGATCAA GGT6T6AAGC CTGTGCTCTT TCCCAACTCC 180 

CCTTTGACAA CAGCCTTCAA CTAACACAAG AAAAGGCATG TCTGACACTC TTCCTGAGTC 240 

TGACTCTGAT ACGTTGTTCT GATGTCTAAA GAGCTCCAGA ACACCAAAGG 6ACAATTCAG 300 

AAT6CTGGTG TATAACA6AC TCCAATGGAG ACCACTACGl C6ATACA 347 
(2) INFORMATION FOR SEQ ID NO: 226: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:226: 
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AGGNGNGGGA NTGTATCGAC GTAGTGGTCT CCCAACAGTC TGTCATTCAG TCT6CAGGTG 60 

TCAGTGTTTT GGACAATGAG GCACCATTGT CACT.TATTGA CTCCTCAGCT CTAAATGCTG 1?0 

AAATTAAATC TTGTCATGAC AAGTCTGGAA TTCCTGATGA GGTTTTACAA AGTATTTTGG 180 

ATCAATACTC CAACAAATCA GAAAGCCAGA AAGAGGATCC TTTCAATAH GCAGAACCAC 240 

GAGTGGATTT ACACACCTCA GGAGACCACT ACGTCGATAC A 281 
(2) INFORMATION FOR SEQ ID N0:227: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:227: 

GGGAAACACT TCCTCCCAGC CTTGTAAGGG TTGGA6CCCT CTCCAGTATA TGCTGCAGAA 60 

TTTTTCTCTC GGTTTCTCAG AGGATTATGG AGTCCGCCTT AAAAAA6GCA AGCTCTGGAC 120 

ACTCTGCAAA GTAGAATG6C CAAAGTTTGG AGTTGA6TGG CCCCTTGAAG GGTCACT6AA ISO 

CCTCACAATT GTTCAAGCTG TGTGGCGGGT TGTTACT6AA ACTCCCGGCC TCCCTGATCA 240 

GTFTCCCTAC ATT6ATCAAT GGCTGAGTH GGTCAGGAGC ACCCCTTCCG T6GCTCCACT 300 

CATGCACCAT TCATAATHT ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC 360 

GACCCTCAGC CGGTTCGGCT CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC 420 

CTCACCCAGT CCCACCGCCT TAAAACCAGC CTACTCCCTT AGG6TCATCC CATGTCTCCT 480 
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CGGCTATGTC CCCTGTAGGC TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG 540 

AAGTA6CCCC TCTACTACCA CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA 600 

4 CCCCCTTCCT GGTTTATGTC CCTTCTTTCT ACTTCTGACT TGTATAATTG GAAAACCCAT 660 

I 

AATCCTCCCT TCTCTGAAAA GCCCCAGGCT TTGACCTCAC TGATGGAGTC TGTACTCTG6 720 

ACACATT6GC CCACCTGGGA TGACTGTCAA CAGCTCCTTT TGACCCTTTT CACCTCTGAA 780 

GAGAGGGAAA 6TATCCAAAG AGAGGCCAAA AAGTACAACC TCACATCAAC CAATAGGCCG 840 

GAGGAGGAAG CTAGAGGAAT AGTGATTAGA GACCCAATTG GGACCTAATT GGGACCCAAA 900 

I TTTCTCAA6T GGAGGGAGAA CTTTTGACGA TTTCCACCGG TATCTCCTCG TGGGTATTCA 960 

GGGAGCTGCT CAGAAACCTA TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA 1020 

TGAGTCACCA G6AGTGTTTT TAGAGCACCT CCAGGAGGCT TATCAGATTT ACACCCCTTT 1080 

TGACCTGGCA 6CCCCCGAAA ATAGCCATGC TCTTAATTTG GCATTTGTGG CTCAGGCAGC 1140 

CCCAGATAGT AAAAGGAAAC TCCAAAAACT A6AGGGATTT TGCTGGAATG AATACCAGTC 1200 

AGCHTTAGA 6ATAGCCTAA AA6GTTTTTG ACAGTCAAGA GGTTGAAAAA CAAAAACAA6 1260 

.^^ CAGCTCAGGC AGCTGAAAAA AGCCACTGAT AAAGCATCCT GGAGTATCAG AGTTTACTGT 1320 

TAGATCA6CC TCATTTGACT TCCCCTCCCA CAIGGIGHT AAATCCAGC1 ACACTACTTC 1380 

% 

CT6ACTCAAA CTCCACTATT CCTGTTCATG ACTGTCAGGA ACT6TTG6AA ACTACTGAAA 1440 

CTGGCCGACC TGATCTTCAA AATGTGCCCC TAGGAAAGGT GGATGCCACC ATGTTCACAG 1500 

ACAGTAGCAG CTTCCTCGAG AAGGGACTAC GAAAGGCCGG TGCAGCTGTT ACCATGGAGA 1560 



(SDOCID: <WO__872542eA3JB> 



wo 97/25426 



189 



PCT/US97/00485 



CAGATGiGTT GTGGGCTCAG GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA 1620 
TCGCCCTCAC TCAGGCTCTC CGATGGGGTA AGGATATTAA CGnAACACT GACAGCAGG7 1680 
ACGCCTTTGC TACTGTGCAT GTACGTGGAG CCATCTACCA GGA6CGT6GG CTACTCACCT 1740 

CAGCAGGTGG CTGTAATCCA CT6TAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT 1800 

GGTAACCAGA AAGCTGATTC AGCA6CTCAA GATGCAGTGT GACTTTCAGI CACGCCTCTA I860 

AACTTGCTGC CCACAGTCTC CTTTCCACAG CCAGATCTGC CTGACAATCC CGCATACTCA 1920 

ACAGAAGAAG AAAACTGGCC TCAGAACTCA GAGCCAATAA AAATCAGGAA GGTTGGTGGA 1980 

TTCnCCTGA CTCTAGAATC TTCATACCCC GAACTCHGG GAAAACHTA ATCAGTCACC 2040 

TACAGTCTAC CACCCATTTA GGAGGAGCAA AGCTACCTCA 6CTCCTCCG6 AGCCGTTTTA 2100 

AGATCCCCCA TCTTCAAAGC CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC 2160 

AGGTAAATGC CAAAAAAGGT CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC 2220 

CAGGAGAAAA GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAl 2280 

ACCTTCTAGT ACTGGTAGAC ACCTTCTCTG GATGGACT6A AGCATTTGCT ACCAAAAAC6 2340 

AAACTGTCAA TATGGTAGTT AAGTTnTAC TCAATGAAAT CATCCCTCGA CATGGGCTGC 2400 

CTGTTTGCCA TAGGGTCTGA TAATGGACCG GCCTTCGCCT TGTCTATAGT HAGTCAGTC 2460 

AGTAAGGCGT TAAACATTCA ATGGAAGCTC CATTGTGCCT ATCGACCCCA GAGCTCTG6G 2520 

CAAGTAGAAC GCATGAACTG CACCCTAAAA AACACTCTTA CAAAATTAAT CTTAGAAACC 2580 

G6TGTAAATT GTGTAAGTCT CCTTCCTTTA GCCCTACTTA GAGTAAGGTG CACCCCTTAC 2640 
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TGGGCTGGGT TCTTACCTTT TGAAATCATG 

^ CTAAGAGATG CCCAATTG6C AAAAATATCA 

'•^ 

I CCCCAACAGG TACAA6ATAT CATCCTGCCA 

CCTGAACAGA CAG66CCCTG CCATTCATTC 
TTCCA6AGAG AAGGACTCCC TCCTGCTTGG 
ACGGCTCTGA AGGTGGATGG CATTCCT6CG 
AACAGAGCCC AACTAGAAAC ATGGGTCCCC 

I CTAAGnGGG T6AAGCCATT AGAHAAHC 

CTTCTGTCAA ACTTATGTAT CTTAAGACTC 
CAATGAHTG AHCCCCCAA AAACACAAGT 
TAACCCTGTT TTTAGACTCT CCCTTTCCTT 
ACTCTCCCTT AGCTAAGAGC GCCAGATGGA 
TTCCTCAAGG ACTTAACTTG TGCAAGCT6A 

4 6ATAAGATAC TGTGGCAAGC TATATCCGCA 

CCCCTCTACC CTTCA6CAAC CACCACCCTG 
GGCCCTCCAC CAGCAAAAAG ATTCTGACTC 
TAGCAGTAAA GTTTTTTTn CniTTCTTT 
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TATGGGAGGG TGCTGCCTAT CTTGCCTAAG 2700 

CAAACTAATT TATTACAGTA CCTACAGTCT 2760 

CTTGTTCGAG GAACCCATCC CAATCCAATT 2820 

CC6CCAGGTG ACCTGTTGTT TGTTAAAAAG 2880 

AAGAGACCTC ACACCGTCAT CACGATGCCA 2940 

TGGAnCATC ACTCCCGCAT CAAAAAGGCC 3000 

AGGGCTGGGT CAGGCCCCTT AAAACTGCAC 3060 

nmCTTAA TTTTGTAAAA CAATGCATAG 3120 

AATATAACCC CCTTGHATA ACTGAGGAAT 3180 

GGGGAATGTA GTGTCCAACC IGGTHTTAC 3240 

TAATCACTCA GCTTGTTTCC ACCTGAATTG 3300 

CTCCATCTTG GCTCTHCAC TGGCAGCCGC 3360 

CTCCCAGCAC ATCCAAGAAT 6CAATTAACT 3420 

6TTCCCAGGA ATTCGTCCAA TTGATCACA6 3480 

ATCAGTCAGC AGCCATCAGC ACCGAG6CAA 3540 

ACTGAAGACT TGGATGATCA TTAGTATTTT 3600 

CTTniTTCT CGTGCC 3646 
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Claims 

1 . An isolated DNA molecule, comprising: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID 
N0:1, SEQ ID N0:3 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166, 168- 
176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227; 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained; or 

(c) a nucleotide sequence encoding an epitope of a polypeptide encoded by 
at least one sequence selected from the group consisting of SEQ ID N0:1, SEQ ID N0:3 - 
SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 
200-204, 206, 207, 209-214, 216, 218, 219, 221-227. 

2. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said polypeptide is encoded by a nucleotide sequence that: 

(a) hybridizes to a sequence selected from the group consisting of SEQ ID 
N0:1, SEQ ID N0:3 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166. 168- 
176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227 under stringent 
conditions; and 

(b) is at least 80% identical to a sequence selected from the group 
consisting of SEQ ID N0:1, SEQ ID N0:3 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 
146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 
221-227; 

and wherein RNA corresponding to said nucleotide sequence is expressed at a greater level in 
human breast tumor tissue than in normal breast tissue. 

3. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said polypeptide is encoded by: 
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(a) a nucleotide sequence transcribed from the sequence of SEQ ID NO: 

141; or 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained. 

4. An isolated DNA or RNA molecule comprising a nucleotide sequence 
complementary to a DNA molecule according to any one of claims 1-3. 

5. A recombinant expression vector comprising a DNA molecule 
according to any one of claims 1-3. 

6. A host cell transformed or transfected with an expression vector 
according to claim 5. 

7. A polypeptide comprising an amino acid sequence encoded by a DNA 
molecule according to any one of claims 1-3. 

8. A polypeptide according to claim 7 wherein said polypeptide 
comprises an epitope of an amino acid sequence encoded by at least one nucleotide sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID N0:3 - SEQ ID NO:77 and 
SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 
209-214,216,218,219, 221-227. 

9. A monoclonal antibody that binds to a polypeptide according to 

claim 7. 
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10. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one polypeptide according to claim 
7, and therefrom determining the presence of breast cancer in the patient. 

11. A method for determining the presence of breast cancer in a patient 
comprising detecting within a biological sample, at least one polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 and SEQ ID 
NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences that 
hybridize thereto under stringent conditions. 



12. The method of claims 10 or 11 wherein the biological sample is a 
portion of a breast tumor. 

^ 13, The method of claim 10 wherein the step of detecting comprises 

contacting the biological sample with a monoclonal antibody according to claim 9. 

14. The method of claim 11 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 
and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences 
that hybridize thereto under stringent conditions. 



15. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, an RNA molecule encoding at least one 
polypeptide according to claim 7, and therefrom determining the presence of breast cancer in 
:1 the patient. 



16. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one RNA molecule encoding a 
polypeptide encoded by a nucleotide sequence selected from the group consisting of SEQ ID 
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NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199,205, 208,215,217,220, and 
sequences that hybridize thereto under stringent conditions; and therefrom determining the 
presence of breast cancer in the patient. 

17. The method of claims 15 or 16 wherein the biological sample is a 
portion of a breast tumor. 

1 8. The method of claim 15 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7, and therefrom determining the 
presence of breast cancer in the patient. 

19. The method of claim 16 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 
208, 215, 217, 220, and sequences that hybridize thereto under stringent conditions; and 
therefrom determining the presence of breast cancer in the patient. 

20. A polypeptide according to claim 7 for use within a method for 
detecting the presence of breast cancer in a patient. 

21. A polypeptide encoded by a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 78-86 and SEQ ID NOS:l44, 145, 153, 167, 177, 193, 199, 
205, 208, 215, 217, 220, and sequences that hybridize thereto under stringent conditions, for 
use within a method for detecting the presence of breast cancer in a patient. 
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22. A method for monitoring the progression of breast cancer in a 
patient, comprising: 

^ (a) detecting an amount, in a biological sample, of at least one polypeptide 

according to claim 7 at a first point in time; 
I (b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
therefrom monitoring the progression of breast cancer in the patient. 

23. A method for monitoring the progression of breast cancer in a 
patient, comprising: 

(a) detecting in a biological sample an amount of at least one polypeptide 
at a first point in time, the polypeptide being encoded by a nucleotide sequence selected from 

i ^he group consisting of SEQ ID NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 

199, 205, 208, 215, 217, 220, and sequences that hybridize thereto under stringent conditions; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

24. The method of claims 22 or 23 wherein the biological sample is a 
portion of a breast tumor. 

25. The method of claim 22 wherein the step of detecting comprises 
contacting a portion of the biological sample with a monoclonal antibody according to 
claim 9. 

i 

26. The method of claim 23 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 
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and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences 
that hybridize thereto under stringent conditions. 

27. The method of either of claims 20 or 22 wherein said polypeptide 
comprises an epitope of an amino acid sequence encoded by at least one nucleotide sequence 

I selected from the group consisting of SEQ ID N0:1, SEQ ID N0:3 - SEQ ID NO:77 and 

SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 
209-214, 216, 218, 219, 221-227. 

28. A method for monitoring the progression of breast cancer in a 
patient, comprising; 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule encoding a polypeptide according to claim 7 at a first point in time; 
>; (b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

29. The method of claim 28 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7. 

^ 30, A method for monitoring the progression of breast cancer in a 

patient, comprising: 

I (a) detecting an amount, within a biological sample, of at least one RNA 

molecule at a first point in time, the RNA molecule encoding a polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 and SEQ ID 
NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences that 
hybridize thereto under stringent conditions; 
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(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

31 . A pharmaceutical composition, comprising a polypeptide according to 
claim 7 and a physiologically acceptable carrier. 

32. A pharmaceutical composition for inhibiting the development of breast 
cancer, comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
being encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 
78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and 
sequences that hybridize thereto under stringent conditions. 

33. A vaccine, comprising a polypeptide according to claim 7 and an 
immune response enhancer. 

34. A vaccine, comprising a DNA molecule according to any one of 

claims 1-3. 

35. A vaccine, comprising a recombinant expression vector comprising a 
DNA molecule according to any one of claims 1-3. 

36. A vaccine for inhibiting the development of breast cancer, comprising 
a polypeptide and an inrunune response enhancer, the polypeptide being encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 and SEQ ID 
NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences that 
hybridize thereto under stringent conditions. 

37. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies according to claim 9; and 
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(b) a detection reagent. 

38. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a nucleotide sequence selected from the group consisting of sequences provided in SEQ 
ID 78-86 and SEQ IDNOS:144, 145, 153, 167, 177, 193, 199,205, 208,215,217, 220; and 

(b) a detection reagent. 

39. The kit of any one of claims 37 or 38 wherein the monoclonal 
antibody(s) are immobilized on a solid support. 

40. A diagnostic kit comprising a first polymerase chain reaction primer 
and a second polymerase chain reaction primer, the first and second primers each comprising 
at least about 10 contiguous nucleotides of an RNA molecule according to claim 4. 

41. A diagnostic kit comprising a first polymerase chain reaction primer 
and a second polymerase chain reaction primer, the first and second primers each comprising 
at least about 10 contiguous nucleotides of an RNA molecule encoding a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOS:78-86 
and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220. 

42. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe containing at least about 15 contiguous nucleotides of a DNA molecule 
according to claim 4. 

43. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe comprising at least about 15 contiguous nucleotides of a DNA 
sequence selected from the group consisting of SEQ ID NOS:78-86 and SEQ ID NOS:144, 
145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220. 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA BISAgl 



TTA GAG ACC CAA TTG GGA CCT AAT TGG GAG CCA AAT TTC TCA AGT GGA 48 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
15 10 15 



GGG AGA ACT TTT GAC GAT TTC CAC CGG TAT CTC CTC GTG GGT ATT CAG 96 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Vol Gly He Gin 
20 25 30 



GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 144 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Alo He Glu Val Val 
35 40 45 



CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 

Vo Gly Vol Phe Leu Glu 
55 ^0 



Gin G]j His Asp Glu Ser Pro Gly Vol Phe Leu Glu His Leu Gin Glu 



GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT AGC 240 

Alo Tyr Arg He Tyr Thr Pro Phe Asp Leu Alo Alo Pro Glu Asn Ser 
65 70 75 80 

CAT GCT CTT AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 288 

His Alo Leu Asn Leu Alo Phe Vol Alo Gin Alo Alo Pro Asp Ser Lys 
85 90 95 

AGG AAA CTC CAA AAA CTA GAG GGA TTT TGC TGG AAT GAA TAC CAG TCA 336 

Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 

100 105 no 

GCT TTT AGA GAT AGC CTA AAA GGT TTT 363 

Ale Phe Arg Asp Ser Leu Lys Gly Phe 

115 120 



Fig. 6 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cONA B17Agl 

6GGCACAGTG GCTCATACCT GTAATCCT6A CCGTTTCAGA GGCTCAGGTG GGGGGATC6C 60 
TT6AGCCCAA GATTTCAAGA CTAGTCTCGG TAACATAGTG AGACCCTATC TCTACGAAAA 120 
AATAAAAAM TGAGCCTGGT GTAGTGGCAC ACACCAGCTG AGGAGGGAGA ATCG 174 

FIG, 7 



iNSOOClD: <WO_9725426A3JB> 



wo 97/25426 



8/21 



PCTAJS97/00485 



NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag2 



TGGGGGCTCT GACTAGAAAT TCAAGGAACC 
TACACTGTGG NCTCCAATAA ACTGCTTCTT 
GAAAACGATG TCTGTGTATA GCCAAGTCAG 
AAATATCAGA ATGTAAAACC TGGGAACCAG 
GAAGACTGAA CAGTACTACT GTGAAAAGCC 
AAGGATGGCT GGGAGAATGA ATGCTCTGTC 
CTTTAT 



TGGGATTCAA GTCCAACTGT GACACCAACT 60 

TCCTATTCCC TCTCTATTAA ATAAAATAAG IPO 

NTATCCTAAA AGGAGATACT AAGTGACATT 180 

GTTCCCAGCC TGGGATTAAA CTGACAGCAA 240 

CGAAGNGGCA ATATGTTCAC TCTACCGTTG 300 

CCCCAGTCCC AAGCTCACTT ACTATACCTC 360 

366 



Fig. 8 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST- TUMOR SPECIFIC cDNA B13Ag2a 



TATAATCATG TTTCTCATTA TTTTCACATT TTATTACCAA TTTCTGTTTA CCCTGAAAAA 60 

TATGAGGGAA ATATATGAAA CAGGGAGGCA ATGTTCAGAT AATTGATCAC AAGATATGAT 120 

TTCTACATCA GATGCTGTTT CCTTTCCTGT TTATTTCCTT TTTATTTCGG TTGTGGGGTT 180 

GAATGTAATA GCTTTGTTTC AAGAGAGAGT TTTGGCAGTT TCTGTAGCTT CTGACACTGC ?40 

TCATGTCTCC AGGCATCTAT TTGCACTTTA GGAGGTGTCG TGGGAGACTG AGAGGTCTAT 300 

TTTTTCCATA TTTG 3j4 



Fig. 9 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA BUAgIb 



ATACAGTCGG TTTCCATTTA TTTAACCCCC 
GGTGTTTTTT ACTGTAAACA ATAAGGAGAC 
TCATATTTTA CGCTCGAGGG TTTTTACCGG 
AACTCGTTTG GAACAAGATA TTTTTTCTTT 
GTGTCTGGGG GACTGCTGGT CACTGTTTCT 
AGAAAAAAAA ATTTTTTTGT TTTATTTGAA 
CTGCTGTATA TAGTTTTAAA TGGTTTATTG 



ACCTGAACGG CATAAACTGA GTGTTCAGCT 60 

TTTGCTCTTC ATTTAAACCA AAATCATATT 120 

TTCCTTTTTA CACTCCTTAA AACAGTTTTT 180 

CCTGGCAGCT TTTAACATTA TAGCAAATTT E40 

CACAGTTGCA AATCAAGGCA TTTGCAACCA 300 

ACTGGACCGG ATAAACGGTG TTTGGAGCGG 360 

CACCTCCTTA AGTTGCACTT ATGT 414 



Fig. 10 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1a 



TATATATTTA ATAACTTAAA TATATTTTGA TCACCCACTG GGGTGATAAG ACAATAGATA 60 

TAAAAGTATT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT lEO 

TCCCCCACCC GCACTGAAAC TTCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 180 

AAGTCTTTGG TGCGTGCTCA CTACTCTTTT TTTTTTTTTT TTTNTTTTGG AGATGGAGTC 240 

TGGCTGTGCA GCCCAGGGGT GGAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 300 

CTCCCAGGTT CATGAGATTC TCCTGNTTCA GCCTTCCCAG TAGCTGGGAC TACAGGTGTG 360 

CATCACCATG CCTGGNTAAT CTTTTTTNGT TTTNOGGTAG AGATGGGGGT TTTACATGTT 420 

GGCCAGGNTG GTNTCGAACT CCTGACCTCA AGTGATCCAC CCACCTCAGG CTCCCAAAGT 480 

GCTAGGATTA CAGACATGAG CC 502 



Fig. 11 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA BIIAgI 



ACATGCAGAA TATTCTATCG GTACTTCAGC 
CTATCCTCAA GATGAGTATT TAGAAAGAAT 
CTCTGACTAC ACGAAATTGT TCAGATGTGA 
ATTATTAAGT GATTATTTTA AAGGGAATCC 
AGATGATATA GAAATAGAAC AGAAAGAGAC 
TGAAGAGCCT ATAGTAGAAA ATGAATTAGC 
TCCTGATGAA TCTTATATTC AGCCATCGAC 
ATAATAGAAA CTGGGTGCGG GGCTATTGAT 
AAAATATAAC TCGATTGCAT 



TATTACTCAT TTTGATGGCG CAATCCGAGC 60 

TGATTTAGCG ATAGACCAAG CTGGTAAGCA 120 

TGGATTTATG ACAGTTGATC TTTGGAAGAG 180 

ATTAATTCCA GAATATCTTG GTTTAGCTCA ?40 

TACAAATGAA GATGTATCAC CAACTGATAT 300 

TGCATTTATT AGCCTTACAC ATAGCGATTT 360 

ATAGCATTAC CTGATGGGCA ACCCTTACGA 420 

GAATTCATCC NCAGTAAATT TGGATATNAC 480 

500 



Fig. 12 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3c 



ACTGATGGAT GTCGCCGGAG GCGAGGGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 180 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTGCT 240 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 286 



Fig. 13 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG1 




CAGCCCCTTC 


TTCTCAATTT CATCTGTCAC TACCCTGGTG TAGTATCTCA TAGCCTTACA 


&0 


TTTTTATAGC 


CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC CATATCACAC 


120 


ATAACTGCAA 


GTAAACATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT GTGCCAAGAA 


180 


ATAGTTTCCA 


TTACCGTCTT AATAAAATTC GGATTTGTTC TTTCCTATTN TCACTCTTCA 


240 


C 




241 



Fig. 14 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG3 



CAAAGCCAGT GGTTTGAGCT CTCTACTGTG 
AATGGTGGCA GGATTTTTAT TATAAACATG 
TATATTCTTC TACATTTAAA CAATAAAAAT 
TTAGGTAAGA GTGTTTAATG AGAGGGTATA 
C 



TAAACTCCTA AACCAAGGCC ATTTATGATA 60 
TACCCATGCA AATTTCCTAT AACTCTGAGA 120 
AATCTATTTT TAAAAGCCTA ATTTGCGTAG 180 
AGGTATAAAT CACCAGTCAA CGTTTCTCTG 240 

241 



Fig, 15 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B2CA2 



CGACGTCGGT AAAATCGGAC ATGAAGCCAC CGCTGGTCTT TTCGTCCGAG CGATAGGCGC 60 

CGGCCAGCCA GCGGAACGGT TGCCCGGATG GCGAAGCGAG CCGGAGTTCT TCGGACTGAG 120 

TATGAATCTT GTTGTGAAAA TACTCGCCGC CTTCGTTCGA CGACGTCGCG TCGAAATCTT 180 

CGAACTCCTT ACGATCGAAG TCTTCGTGGG CGACGATCGC GGTCAGTTCC GCCCCACCGA ?40 

AATCATGGTT GAGCCGGATG CTGCCCCCGA AGCCCT 276 



Fig. 16 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA1 



CCCAGGTCAA CCAGGCTGCA ACACGCAGGT CCTTCGATTG GGCACGAAGC AGCGCTTCGC 60 

TGTTTTCCAG GATTTTCAAC CAGTCGGTCT GGCCGTTCTC ATGGAGCGAG AGCGCCTTGC 120 

CCAGCTCATT TTCCAGCGCC TCGTATTCGC TGGAAAAACG CACATCCTCA CCCGCAAAGA 180 

CATCCTTTGA AATCGGCTGT TCCGCGAGTT CCAGATANTG CGAGGAGAGC TTGCTCGAAT 240 

AGGTCATCCT AACCCTTCAA TGCACACCAT GTGCGCCAAT GAATATCTTA ACAATTCAAC 300 

TAGTTGGCAT AANAACCGAA CGAAAATCCC AATAGTCTGA AGAGCTCTTT TG 352 



Fig. 17 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA2 



CTGCATGTCC ACG6CCTGGA TTTACGGGTG GTCGGCGTTC ACCCCTGGCA GCTGGCGCTC 60 

TTCCCGACCA GGCCCAGCAG GATGTGTGGG GCAAGGATAA CGGCGTGCGC ATCGCCTCGA 120 

CCTATATGCC TACTGGCAAG GCCGAGCCCG TGGAAGGCGG ATTCAGGTTC ANCGGTCGCT 180 

GGAGCTTTTC CACCGGCTCC ATGCATTGTG ACTGGCTGTT TCTAGGCGGT CTGTTGCCCA 240 

AGCGTGATGG TACGTCTGGC CTGGAGCATG TGACTTTCTG 280 



Fig. 18 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3 



AGCAAGGAGA AGGCCAAGGA GAGGCTCAAG CTGGTCCTGG CCTACGACTG GGCCAAGCTG 60 

TCGCCGGGGA TGGTGGAGAA CCTGAAGCGG GACCTCCTCG AGGTCCTCCG CCGCTACTTC 120 

TCCGTCCAGG AGGAGGGTCT TTCCGTGGTC TTGGAGGAGC GGGGGGAGAA GATNCTCCTC 180 

ATGGTCNACA TCCCCCTCAN GTGATGGTCC TGANGNGNCC CNTCCTCCTT GNCTACGATT 240 

TCGGNCTGGT GGCCCTNTTT CT p.p 



Fig. 19 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B4CA1 



AGGAGCGGGT AGAGTGGCAC CATTGAGGGG ATATTCAAAA ATATTATTTT GTCCTAAATG 60 

ATAGTTGCTG AGTTTTTCTT TGACCCATGA GTTATATTGG AGTTTATTTT TTAACTTTCC 120 

AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGATT NCTATTTTTA TTAAATTGGA 180 

TTTGAGAAAT TGGTTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG ACATTATAGC ?40 

TTAGTATGT 249 



Fig. 20 
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Invention 1 : 



Claim 3, claims 1,2,4-10,12,13,15,17-20,22,24,25,27-29,31,33-35,37,39, 
40,42 (parrially) : 

nucleic acids of human endogenous retroviral elemenr 
encoding breast tumor specific polypeptides 

with sequences 1,3-10,103,126-140,141 , polypeptide with amino 
acids sequence 2 , corresponding polypeptides , antibodies, epitopes , 
pharraaceutical composition , vaccines , their use for determining the 
presence , or monitoring the progresssion of breast cancer and 
diagnostic kit » 



Inventions 2-142 : 



Claims 1,2, 4-10,1.. 13, 15, 17-20, 22, 24, 25, 27-29, 31, 33-35, 37, 39, 
40,42 (parrially) : 

nucleic acids encoding breast tumor specific polypeptides 
with sequence 11 , corresponding polypeptides , antibodies, epitopes , 
pharmaceutical composition , vaccines , their use for determining the 
presence , or monitoring the progresssion of breast cancer and 
diagnostic kit . 

...ibidem for each of sequences 12-77, 142,143,146-152,154-166,168-176, 
178-192,194-198,200-204 ,206, 207,209-214,216,218,219,221-227 
separately 

Inventions 143-163 : 

Claims 11,14,16,17,19,21,23,24,26,27,30,32,36,38,39,41,43 (partially) : 
breast tumor specif c polypeptides with sequence 78 , corresponding , 
antibodies, epitoras , pharmaceutical composition and vaccines against 
breast cancer , their use for determining tbe presence , or monitoring 
the progresssion of breast cancer and diagnostic kit . 

...ibidem for each of sequences 79-86,144,145,153,167,177,193,199,205, 
208 , 215 , 217 , 220 separately 



MSOOCID: <WO_9725426A3JB> 



INTERNATIONAL SEARCH REPORT 

iitfoniuoon on patent fmmity monbcn 



Inien ml Applicatton No 

PCT/US 97/00485 



Patent document 
cited in search report 


1 Publication 
1 date 




Patent family 
mcmbcr(s) 


1 Publication 
date 


wo 9532311 


A 


30-11-95 


AU 
CA 
EP 


2690995 A 

o 1 m Ann a 

2191099 A 
0763137 A 


18- 12-95 

19- 03-97 


GB 2273G99 


A 


08-06-94 


WO 


9411514 A 


26-05-94 


WO 9519369 


A ~ 


20-07-95 


AU 


1831795 A 


01-08-95 


WO 9102062 


A 


21-02-91 


AU 
AU 
CA 
EP 
JP 


645760 6 
6413590 A 
2042064 A 
0444181 A 
4503012 T 


27-01-94 
11-03-91 
05-02-91 
04-09-91 
04-06-92 



Fonn PCT/l$A.aiO (puimi family tonci) (July 1992) 



ISDOCtD: <WO_g72542eA3JB> 



